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The present invention 
is a substantially purified 
sortase-transamidase enzyme from 
Gram-positive bacteria, such as 
Staphylococcus aureus* The enzyme 
having a molecular weight of about 
41.000 daltons and catalyzing a 
reaction that covalently cross-links 
the carboxyl terminus of a protein 
having a sorting signal to the 
peptidoglycan of a Gram-positive 
bacterium, the sorting signal having: 
(1) a motif of LPX3X4G therein; (2) 
a substantially hydrophobic domain 
of at least 31 amino acids carboxyl 
to the motif; and (3) a charged tail 
region with at least two positively 
charged residues caiboxyl to the 
substantially hydrophobic domain, 
at least one of the two positively 
charged residues being arginine, 
the two positively charged residues 
being located at residues 31-33 
from the motif, wherein X3 is any 
of the twenty naturally-occurring 
L-amino acids and X4 is selected 
from the group consisting of alanine, 
serine, and threonine, and wherein 
sorting occurs by cleavage between 
the fourth and fifth residues of the 
LPX3X4G motif. Variants of the enzyme, mediods for cloning 
of use of the enzyme, including for screening for antibiotics 
bacteria, are also disclosed. 
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IDENTIFICA-nON OF SOI^TAgE G]ENR 

GOVERNMENT RIGHTS 

This invention was supported by grants from the United States 
government, namely grants from the National Institutes of Health, NIH-NIAID Grant 
Nos. AI 33985 and 38897. Accordingly, the government may have certain rights in 
this invention. 

5 

BACKGROUND OF THE INVENTION 
This invention is directed to an enzyme from Gram-positive bacteria, 
designated sortase-transamidase, nucleic acid segments encoding the enzyme, and 
methods of use of the enzyme. 

10 Human infections caused by Gram-positive bacteria present a medical 

challenge due to the dramatic increase in multiple antibiotic resistance strains in 
recent years. Gram-positive bacteria that can cause serious or fatal infections in 
himians include Staphylococcus, Streptococcus , Enterococcus^ Pneumococcus, 
Bacillus, Actinomyces, Mycobacterium, and Listeria, as well as others. Infections 

15 caused by these pathogens are particularly severe and difficult to treat in 

immunologically compromised patients. These include patients suffering from 
infection with the Hxmian Inununodeficiency Virus (HIV), the virus that causes AIDS* 
as well as patients given irmnune-suppressive ^ents for treatment of cancer or 
autoimmune diseases. In particular, infections caused by various Mycobacterium 

20 species, including M tuberculosis, M, bovis, M. avium, and M. intracellular e, are 
frequently the cause of disease in patients with AIDS. 

Therefore, it is apparent that new target sites for bacterial 
chemotherapy are needed if such pathogenic organisms are to be controlled. 

A unique characteristic of these pathogens and many Gram-positive 

25 bacteria is their surface display of proteins anchored to the cell wall. In fact, many of 
these molecules are known to be involved in essential cellular ftmctions, including 
pathogenesis in a susceptible host. Thus, a possible disruption in this anchoring 
process may prove to be an effective treatment against these disease-causing 
elements. 

30 The anchoring of surface molecules to the cell wall in Gram-positive 

bacteria has been demonstrated to involve a conserved pathway, culminating in 
recognition of a conserved cleavage/anchoring site by some previously 
\mcharacterized cellular machinery. Molecules whose ultimate location is the cell 
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wall must invariably be translocated across the single cellular membrane of these 
organisms. This is mediated for all cell wall anchored proteins by the well studied 
secretory pathway, involving cleavage of an amino-terminal signal peptide by a type I 
signal peptidase. Upon translocation of the molecule out of the cytoplasm, a 
5 mechanism must be present that extracellularly recognizes this protein as a substrate 
for anchoring. This process has been previously shown to involve the carboxyl- 
terminally located cell wall sorting signal, consisting of a highly conserved motif such 
as LPXTG (SEQ ID NO: 1 ), in which X can represent any of the twenty naturally 
occurring L-amino acids, followed by a series of hydrophobic residues and ultimately 

10 a sequence of positively-charged residues. Thus, once amino-terminally modified 
and successfully secreted, a polypeptide with this carboxyl-terminal sequence can 
present itself as a substrate to be processed by the anchoring machinery. At this time, 
cleavage of the sorting signal after the threonine residue is coupled with covalent 
linkage of the remainder of the polypeptide to the free amino group of the 

1 5 pentaglycine crossbridge in the cell wall . 

It is this transpeptidation reaction that anchors mature surface proteins 
so that the peptidoglycan layer, from which point the molecules can serve their 
biological fimctions. Therefore, there is a need to isolate and purify the enzyme that 
catalyzes this reaction. There is also a need to identify the gene encoding such an 

20 enzyme in order that the enzyme can be produced by genetic engineering techniques. 

Additionally, there is also a need to develop new methods for 
displaying proteins or peptides on the surfaces of bacteria. For many purposes, it is 
desirable to display proteins or peptides on the surfaces of bacteria so that the proteins 
or peptides are accessible to the surrounding solution, and can, for example, be bound 

25 by a ligand that is bound specifically by the protein or peptide. In particular, the 
display of proteins on the surface of bacteria is desirable for the preparation of 
vaccines, the linkage of molecules such as antibiotic molecules or diagnostic reagents 
to cells, for screening reagents such as monoclonal antibodies, and for the selection of 
cloned proteins by displaying the cloned proteins, then observing their reaction with 

30 specific reagents such as antibodies. One way of doing this has been with phage 
display (G.P. Smith, "Filamentous Fusion Phage: Novel Expression Vectors that 
Display Cloned Antigens on the Virion Surface," Science 228:13 15-1316 (1985)). 
However, phage display is limited in its practicality, because it requires that the 
protein being displayed to be inserted into a coat protein of filamentous phage and 

35 retain its activity while not distorting the conformation of the coat protein, allowing 
fiinctional virions to be formed. In general, this technique is therefore limited only to 
small peptide and proteins. 
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Therefore, there is a need for a more general method of peptide and 
protein display. 

SUMMARY 

5 The present invention is directed to sortase-transamidase enzymes 

from Gram-positive bacteria, particularly Staphylococcus aureus^ and methods for 
their use, particularly in the areas of drug screening and peptide and protein display. 

One aspect of the present invention is a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme catalyzing a 

10 reaction that covalently cross-links the carboxyl terminiis of a protein having a sorting 
signal to the peptidoglycan of a Gram-positive bacterium, the sorting signal having a 
motif of LPX3X4G therein, wherein sorting occurs by cleavage between the fourth and 
fifth residues of the LPX3X4G motif. Typically, the Gram-positive bacterium is a 
species selected from the group consisting of Staphylococcus aureus, S. sobrinus, 

1 5 Enterococcus faecalis^ Streptococcus pyogenes, and Listeria monocytogenes. 
Preferably, the Gram-positive bacterium is S. aureus. The enzyme may be a 
heterooligomer. 

Preferably, the enzyme has at least one submit with a molecular weight 
of about 41,000 daltons and the sorting signal further includes: (2) a substantially 

20 hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and (3) a 
charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected from the group consisting of alanine, serine, and threonine. 

Preferably, the enzyme includes therein a subunit whose amino acid 
sequence is selected from the group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V- 
L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y-L- 
E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A- 

30 R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E- 
D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID NO: 
2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S- 
I-A-T-S-E-V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L- 
S-R-D-L-K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T- 

35 A-N-K-I-Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I- 
V-I-E-K (SEQ ID NO: 31) and (3) sequences incorporating one or more 
conservative amino acid substitutions in SEQ ID N0:2 or SEQ ID NO: 3 1 , wherein 
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the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 
5 Another aspect of the present invention is a nucleic acid sequence 

encoding this enzyme. In one altemative, the nucleic acid sequence includes therein a 
sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 
TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 

15 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGA^^ 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCTAATATC 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); and (3) a sequence complementary to SEQ ID NO: 
28 or SEQ ID NO: 30. In another altemative, the nucleic acid sequence can include a 

25 sequence hybridizing with SEQ ID NO: 28, SEQ ID NO: 30 or a sequence 

complementaiy to SEQ ID NO: 28 or SEQ ID NO: 30 with no greater than about a 
15% mismatch under stringent conditions. Preferably, the degree of mismatch is less 
than about 5%; more preferably, the degree of mismatch is less than about 2%. 

Yet another aspect of the present invention is a vector comprising the 

30 nucleic acid sequence of the present invention operatively linked to at least one 
control sequence that controls the expression or regulation of the nucleic acid 
sequence. 

Yet another aspect of the present invention is a host cell transfected 
with a vector of the present invention. 
35 Another aspect of the present invention is a method for producing a 

substantially purified sortase-transamidase enzyme. The method comprises the steps 
of: 
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(1) culturing a host cell according to the present invention under 
conditions in which the host cell expresses the encoded sortase-transamidase enzyme; 
and 

(2) purifying the expressed enzyme to produce substantially purified 
5 sortase-transamidase enzyme. 

Another aspect of the present invention is a method for screening a 
compound for anti-sortase-transamidase activity. This method is important in 
providing a way to screen for antibiotics that disrupt the sorting reaction and are likely 
to be effective in treating infections caused by Gram-positive bacteria. 
10 In one alternative, the screening method comprises the steps of: 

(1) providing a substantially purified sortase-transamidase enzyme 
according to the present invention; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

15 (3) comparing the activity of the sortase-transamidase enzyme in the 

presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

In another alternative, the screening method comprises the steps of: 

(1) providing an active fraction of sortase-transamidase enzyme from 
20 a Gram-positive bacterium; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

(3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 

25 transamidase activity. 

The active fraction of sortase-transamidase activity can be a particulate 
fraction from Staphylococcus aureus. 

The assay for sortase-transamidase enzyme can be performed by 
monitoring the capture of a soluble peptide that is a substrate for the enzyme by its 

30 interaction with an affinity resin. In one alternative, the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. In 
another alternative, the soluble peptide includes the active site of glutathione S- 
transferase and the affinity resin contains glutathione. In yet another alternative, the 
soluble peptide includes the active site of streptavidin and the affinity resin contains 

35 biotin. In still another altemative, the soluble peptide includes the active site of 
maltose binding protein and the affinity resin contains amylose. 
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Still another aspect of the present invention is an antibody specifically 
binding the sortase-transamidase enzyme of the present invention. 

Yet another aspect of the present mvention is a protein molecule 
comprising a substantially purified sortase-transamidase enzyme according to the 
5 present invention extended at its carboxyl-terminus with a sufficient number of 
histidine residues to allow specific binding of the protein molecule to a nickel- 
sepharose column through the histidine residues added at the carboxyl-terminus. 

Still another aspect of the present invention is a method for displaying 
a polypeptide on the surface of a GramiJositive bacterium comprising the steps of: 
10 (1) expressing a polypeptide having a sorting signal at its carboxy- 

terminal end, the sorting signal having: (a) a motif of LPX3X4G therein; (b) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(c) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
1 5 being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(2) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) a substantially purified sortase-transamidase according to the present 

20 invention; and (iii) a Gram-positive bacterium having a peptidoglycan to which the 
sortase-transamidase can link the.polypeptide; and 

(3) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 

25 peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

Another display method according to the present invention comprises: 

(1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

30 carboxyl-terminal sorting signal as described above; 

(2) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 
including therein a carboxyl-terminal sorting signal; and 

(3) binding the polypeptide covalently to the cell wall by the enzymatic 
35 action of a sortase-transamidase expressed by the Gram-positive bacterium involving 

cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
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displayed on the svirface of the Gram-positive bacterium in such a way that the 
polypeptide is accessible to a ligand. 

Another aspect of the present invention is a polypeptide displayed on 
the surface of a Gram-positive bacterium by covalent linkage of an amino-acid 
5 sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, wherein X3 is any 
of the twenty naturally-occurring L~amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine, the polypeptide being displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand. 
10 Another aspect of the present invention is a covalent complex 

comprising: 

(1) the displayed polypeptide; and 

(2) an antigen or hapten covalently cross-linked to the polypeptide. 
Yet another aspect of the present invention is a method for vaccination 

15 of an animal comprising the step of immunizing the animal with the displayed 
polypeptide to generate an immune response against the displayed polypeptide, or, 
alternatively, with the covalent complex to generate an immune response against the 
antigen or the hapten. 

Still another aspect of the present invention is a method for screening 

20 for expression of a cloned polypeptide comprising the steps of: 

(1) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end as described above; 

(2) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) a substantially purified sortase-transamidase enzyme according to the 

25 present invention; and (iii) a Gram-positive bacterium having a peptidoglycan to 
which the sortase-transamidase can link the polypeptide through the sorting signal; 

(3) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

30 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(4) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

35 Still another aspect of the present invention is a method for the 

diagnosis or treatment of a bacterial infection caused by a Gram-positive bacterium 
comprising the steps of: 
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(1) conjugating an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal as described above to produee a 
conjugate; and 

(2) introducing the conjugate to an organism infected with a Gram- 

5 positive bacterixun in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 

If an antibiotic is used, typically it is a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 

1 0 chloramphenicol, norfloxacin, or a derivative of these antibiotics. 

Similarly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 
including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. In still another aspect of the present invention, a composition comprises 

15 the conjugate with a pharmaceutically acceptable carrier. 

Another aspect of the present invention is a substantially purified 
protein having at least about 50% match with best alignment with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), the 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 

20 Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. Preferably, the match is at least about 60% in best alignment; more 
preferably, the match is at least about 70% in best alignment. 

Another aspect of the present invention is a substantially purified 
protein having sortase-transamidase activity and a hydrophobicity profile of at least 

25 one subunit of the protein that, determined as the mean absolute value of the 

hydrophobicity difference per residue, differs fi*om the hydrophobicity profile of a 
putative Bacillus peptidase (SEQ ID NO: 3) by no more than about 2 units on the 
hydrophobicity scale. Preferably, the difference is not more than about 1 unit; most 
preferably, it is not more than about 0.5 units. 

30 Another aspect of the present invention is a substantially purified 

protein having sortase-transamidase activity and a hydrophobicity profile of at least 
one subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs fi-om the hydrophobicity profile of the 
sequence of SEQ ID NO: 3 1 by no more than about 2 units on the hydrophobicity 

35 scale. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of the present 
invention will become better understood with reference to the following description 
5 and accompanymg drawings where: 

Figure 1 is a diagram showing the substantial homology of the amino 
acid sequence of the sortase-transamidase enzyme of Staphylococcus aureus to an 
open reading frame in the genome of Streptococcus pyogenes (SEQ ID NO: 2 & 34); 

Figure 2 is a diagram showing a greater homology of the amino acid 
10 sequence of the sortase-transamidase enzyme of Staphylococcus aureus to the 

carboxyl-terminal portion of the open reading frame in the genome of Streptococcus 
pyogenes (SEQ ID NO: 2 & 34); 

Figure 3 is the DNA sequence of the 5. pyogenes open reading frame 
(SEQ ID NO: 33 & 34); 
15 Figure 4 (SEQ ID NO: 34) is the entire amino acid sequence of the 

protein translated from the entire 5. pyogenes open reading frame; 

Figure 5 (SEQ ID NO: 3) is the amino acid sequence of a putative 
Bacillus peptidase in the GCVT-SPOIIIAA intergenic region; 

Figure 6 is the hydrophobicity profile of the protein whose amino acid 
20 sequence is shown in Figure 5 (SEQ ID NO: 3); 

Figure 7 (SEQ ID NO: 4) is the amino acid sequence of the 
aminopeptidase P of Lactococcus lactis; 

Figure 8 (SEQ ID NO: 5) is the amino acid sequence of the proline 
dipeptidase of Lactobacillus delbrueckii lactis; 
25 Figure 9 is a diagram of the activity of the sortase-transamidase 

enzyme of the present invention; 

Figure 10 (SEQ ID NOS: 28 & 29) is a partial DNA sequence of the 
gene for one of the subxmits of the sortase-transamidase enzyme of S. aureus; 

Figure 1 1 (SEQ ID NO: 2) is the partial carboxyl-terminal amino acid 
30 sequence translated from the DNA sequence of Figure 10 (SEQ ID NOS: 28 & 29); 

Figure 12 (SEQ ID NOS: 30 & 31) is a partial DNA sequence of the 
gene for a second of the subunits of the sortase-transamidase enzyme of S. aureus; 
and 

Figure 13 is the hydrophobicity profile of the protein translated from 
35 the DNA sequence of Figure 12 (SEQ ID NOS: 30 & 31). 
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DEFINITIONS 

As used herein, the terms defined below have the following meanings 
unless otherwise indicated: 

"Nucleic Acid Sequence": the term "nucleic acid sequence" includes 
5 both DNA and RNA unless otherwise specified, and, unless otherwise specified, 
includes both double-stranded and single-stranded nucleic acids. Also included are 
hybrids such as DNA-RNA hybrids. In particular, a reference to DNA includes RNA 
that has either the equivalent base sequence except for the substitution of uracil and 
RNA for thymine in DNA, or has a complementary base sequence except for the 
10 substitution of uracil for thymine, complementarity being determined according to the 
Watson-Crick base pairing rules. Reference to nucleic acid sequences can also 
include modified bases as long as the modifications do not significantly interfere 
either with binding of a ligand such as a protein by the nucleic acid or with Watson- 
Crick base pairing. 

15 "Antibody": as used herein the term "antibody" includes both intact 

antibody molecules of the appropriate specificity, and antibody firagments (including 
Fab, F(ab'), Fv, and F(ab')2), as well as chemically modified intact antibody 
molecules and antibody Augments, including hybrid antibodies assembled by in vitro 
reassociation of subunits. Also included are single-chain antibody molecules 

20 generally denoted by the term sFv and hxmianized antibodies in which some or all of 
the originally non-human constant regions are replaced with constant regions 
originally derived fi-om human antibody sequences. Both polyclonal and monoclonal 
antibodies are included unless otherwise specified. Additionally included are 
modified antibodies or antibodies conjugated to labels or other molecules that do not 

25 block or alter the binding capacity of the antibody. 

DESCRIPTION 

A substantially purified sortase-transamidase enzyme firom Gram- 
positive bacteria, particularly Staphylococcus aureus. 
30 The properties of this enzyme make it a logical target for antibiotic action. This 
enzyme also catalyzes covalent crosslinkage of proteins to the peptidoglycan of 
Gram-positive bacteria. 

I. THE SORTASE-TRANSAMIDASE ENZYME 
35 One aspect of the invention is a substantially purified sortase- 

transamidase enzyme fi:om a Gram-positive bacterium. As used herein, the term 
"substantially purified" means having a specific activity of at least tenfold greater than 
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the sortase-transamidase activity present in a crude extract, lysate, or other state fix)m 
which proteins have not been removed and also in substantial isolation fix)m proteins 
found in association with sortase-transamidase in the cell. 

One subunit of the enzyme has a molecular weight of about 41 ,000 
5 daltons. The enayme catalyzes a reaction that covalently crosslinks the carboxyl- 
terminus of a protein having a sorting signal to the peptidoglycan of the Gram- 
positive bacterium. The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

10 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif In this sorting signal, X3 can be any of the twenty naturally-occurring 
L-amino acids. X4 can be alanine, serine, or threonine. Preferably, X4 is threonine. 

The sortase-transamidase is believed to occur in all Gram-positive 

15 bacteria. In particular, the enzyme exists in Mycobacterium, NocardiCy Actinomyces, 
Staphylococcus, Streptococcus, Listeria, Enter ococctds, and Pneumococcus. 
Specifically, the enzyme exists in the following species: Staphylococcus aureus, S. 
sobrinus, Enterococcus faecalis. Streptococcus pyogenes, and Listeria 
monocytogenes. 

20 Preferably the enzyme is isolated from Staphylococcus aureus, 

A. Amino Acid Sequence 

The sortase-transamidase of the present invention includes therein an 
amino acid sequence, in one subunit of the enzyme, of D-P-K-L-K-E-I-Y-Q-I- 
V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y- 
25 L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L- 
A-R-P-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-G-I- 
R-I-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-TpEQ 
ID N0:2). This sequence is at the carboxyl-terminal end of the subunit of the 
enzyme, 

30 The sortase-transamidase of the present invention also includes therein 

an amino acid sequence, in a second subunit of the enzyme, of M-V-K-V-T-D-Y- 
S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E-V-E-G-I-T- 
G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 

35 N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 
31). 
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Also within the scope of the present invention are substantially purified 
protein molecules that are mutants of the sequence of SEQ ID N0:2 or SEQ ID NO: 
3 1 that preserve the sortase-transamidase activity. In particular, the conservative 
amino acid substitutions can be any of the following: (1) any of isoleucine, leucine, 
5 and valine for any other of these amino acids; (2) aspartic acid for glutamic acid and 
vice versa; (3) glutamine for asparagine and vice versa; and (4) serine for threonine 
and vice versa. 

Other substitutions can also be considered conservative, depending 
upon the environment of the particular amino acid. For example, glycine (G) and 

10 alanine (A) can frequently be interchangeable, as can be alanine and valine (V). 

Methionine (M), which is relatively hydrophobic, can frequently be interchanged with 
leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are 
frequently interchangeable in locations in which the significant feature of the amino 
acid residue is its charge and the different pK's of these two amino acid residues or 

15 their different sizes are not significant. Still other changes can be considered 
"conservative" in particular environments. For example, if an amino acid on the 
surface of a protein is not involved in a hydrogen bond or salt bridge interaction with 
another molecule, such as another protein subunit or a ligand bound by the protein, 
negatively charged amino acids such as glutamic acid and aspartic acid can be 

20 substituted for by positively charged amino acids such as lysine or arginine and vice 
versa. Histidine (H), which is more weakly basic than arginine or lysine, and is 
partially charged at neutral pH, can sometimes be substituted for these more basic 
amino acids. Additionally, the amides glutamine (Q) and asparagine (N) can 
sometimes be substituted for their carboxylic acid homologues, glutamic acid and 

25 aspartic acid. 

The sortase-transamidase from Staphylococcus aureus has substantial 
homology the amino acid sequence of the first subimit, that of SEQ ID NO: 2, with an 
open reading frame in the genome oi Streptococcus pyogenes^ particularly in the 
amino-terminal region. There is about a 22% match with best alignment over the 

30 entire sequenced region of the S. pyogenes open reading frame, and about a 47% 

match with best alignment over the carboxyl-terminal region of the S. pyogenes open 
reading frame. These matches are shown in Figures 1-2. The DNA sequence of the 
entire S. pyogenes open reading frame is shovm in Figure 3 (SEQ ID NO: 33 & 34). 
The protein translated from the entire S. pyogenes open reading frame has a molecular 

35 weight of about 40,85 1 .43 daltons; its sequence is shown in Figure 4 (SEQ ID NO: 
34). Therefore, another aspect of the present invention is a substantially purified 
protein molecule that has at least one subunit of about 40,000 to about 41,000 daltons 
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in molecular weight, the subunit having at least a 20% match with best aligmnent with 
the 5. pyogenes open reading frame of Figure 2 and that has sortase-transamidase 
activity. Preferably, the subunit has at least a 30% match with best alignment; more 
preferably, at least 50% match with best alignment. 
5 As shown below in the Example, the first 364 bases of a nucleic acid 

segment that complements a temperature-sensitive mutation in the S. aureus sortase- 
transamidase, designated the SM-3 1 7 complementing gene insert, has been identified 
as encoding a protein sequence that is a homologue of a putative Bacillus peptidase in 
the GCVT-SPOIIIAA intergenic region (GenBank Accession No. 1 73 1 048; Y. 

10 Kobayashi et al,). The sequence of this putative peptidase is shown in Figure 5 (SEQ 
ID N0:3) and its hydrophobicity profile is shown in Figure 6. The hydrophobicity is 
calculated according to the method of J. Kyte & R.F. Doolittle, "A Simple Method for 
Displaying the Hydropathic Character of a Protein," J. MoL Biol. 157: 105-132 
(1982). As used herein, the term "hydrophobicity" is the hydrophobicity as calculated 

15 in Kite & Doolittle, supra . 

To a lesser degree of homology, the protein sequence encoded by this 
complementing gene insert is homologous to aminopeptidase P of Lactococcus lactis 
(GenBank Accession No. 1915907; J. Matos). The amino acid sequence of this 
aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still lesser degree of 

20 homology, the protein sequence encoded by this complementing gene insert is 

homologous to the proline dipeptidase of Lactobacillus delbrueckii lactis (GenBank 
Accession No. 1 172066; K. Stucky et al., "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene fi-om Lactobacillus delbrueckii subsp. lactis and Partial 
Characterization of Its Product," MoL Gen. Genet. 247: 494-500 (1 995)). The amino 

25 acid sequence of this proUne dipeptidase is shown in Figure 8 (SEQ ID N0:5). 

Because of the relatedness of these proteins, another aspect of the 
present invention is a substantially purified protein having at least one subunit with at 
least about 50% match with best alignment with the amino acid sequences of at least 
one of the putative Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of 

30 Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of Lactobacillus 
delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase activity. 
Preferably, the at least one subunit of the protein has at least about 60% match with 
best alignment vsdth at least one of these sequence; more preferably, the at least one 
subunit of the protein has at least about 70% match with best alignment with at least 

35 one of these sequences. 

Because the hydrophobicity of a protein is a sensitive measure of 
protein structure, another aspect of the invention is a substantially purified protein 
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having sortase-transamidase activity and a hydrophobicity profile for at least one 
subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs from the hydrophobicity profile of the 
putative Bacillus peptidase by no more than about 2 units on the hydrophobicity scale 
5 of Kyte & Doolittle, supra . Preferably, the difference is no greater than about 1 unit; 
more preferably, the difference is no greater than about 0.5 units. 

The sortase-transamidase is a cysteine protease. 

B. Activitv of the Sortase-Transamidase 

1 0 The activity of the sortase-transamidase enzyme of the present 

invention is shown, in general, in Figure 9. The enzyme first cleaves a polypeptide 
having a sorting signal within the LPX3X4G motif Cleavage occurs after residue X4, 
normally a threonine; as indicated above, this residue can also be a serine or alanine 
residue. This residue forms a covalent intermediate with the sortase. The next step is 

1 5 the transamidation reaction that transfers the cleaved carboxyl terminus of the protein 
to be sorted to the "NH2 of the pentaglycine crossbridge within the peptidoglycan 
precursor. The peptidoglycan precursor is then incorporated into the cell wall by a 
transglycosylase reaction with the release of undecaprenyl phosphate. The mature 
anchored polypeptide chains are thus linked to the pentaglycine cross bridge in the cell 

20 wall which is tethered to the e-amino side chain of an unsubstituted cell wall 
tetrapeptide. A carboxypeptidase may cleave a D-Ala-D-Ala bond of the 
pentapeptide structure to yield the final branched anchor peptide in the staphylococcal 
cell wall. 

The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region. 

In the motif, X3 can be any of the 20 naturally-occurring L-amino 
acids, X4 can be any of threonine, serine, or alanine. Preferably, X4 is threonine (O. 
Schneewind et al., "Cell Wall Sorting Signals in Surface Proteins of Gram-Positive 
30 Bacteria," EMBO J. 12:4803-481 1 (1993)). 

Preferably, the substantially hydrophobic domain carboxyl to the motif 
includes no more than about 7 charged residues or residues with polar side chains. 
For the purposes of this specification, these residues include the following: aspartic 
acid, glutamic acid, lysine, and arginine as charged residues, and serine, threonine, 
35 glutamine, and asparagine as polar but imcharged residues. Preferably, the sequence 
includes no more than three charged residues. 
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Representative sequences suitable for sorting signals for use with the 
sortase-transamidase of the present invention include, but are not limited to the - 
followmg: E-E-N-P-F-I-G-T-T-V-F-G-G-L-S-L-A-L-G-A-A-L-L-A-G 
(SEQ ID NO: 6), the hydrophobic domain of the staphylococcal protemase (SPA) 
5 sorting signal from Staphylococcus aureus; (2) G-E-E-S-T-N-K-G-M-L-F-G- 
G-L-F-S-I-L-G-L-A-L-L (SEQ ID N0:7), the SNBP signal of S. aureus; (3) D- 
S-S-N-A-Y-L-P-L-L-G-L-V-S-L-T-A-G-F-S-L-L-O-L (SEQ ID NO: 8), 
the SPAA signal of 5. sobrinus, (4) E-K-Q-N-V-L-L-T-V-V-G-S-L-A-A-M- 
L-G-L-A-G-L-G-F (SEQ ID N0:9), the PRGB signal of Enterococcus faecalis, 

10 (5) S-I-G-T-Y-L-F-K-I-G-S-A-A-M-I-G-A-I-G-I-Y-I-V ^SEQ IDNO:10), 
the TEE signal of Streptococcus pyogenes, and (6) D-S-D-N-A-L-Y-L-L-L-G- 
L-L-A-V-G-T-A-M-A-L-T (SEQ ID NO: 1 1 ), the INLA signal of Listeria 
monocytogenes. Other hydrophobic domains can be used as part of the sorting signal. 

The third portion of the sorting signal is a charged tail region with at 

1 5 least two positively charged residues carboxyl to the substantially hydrophobic 

domain. At least one of the two positively charged residues is arginine. The charged 
tail can also contain other charged amino acids, such as lysine. Preferably, the 
charged tail region includes two or more arginine residues. The two positively 
charged residues are located at residues 3 1-33 from the motif Preferably, the two 

20 arginine residues are either in succession or are separated by no more than one 

intervening amino acid. Preferably, the charged tail is at least five amino acids long, 
although four is possible. Among the charged tails that can be used are the following: 
(1) R-R-R-E-L (SEQ ID N0:12), from the SPA signal of S aureus; (2) R-R-N-K- 
K-N-H-K-A (SEQ ID NO: 13), from the SNBP signal of 5. aureus; (3) R-R-K-Q- 

25 D (SEQ ID N0:14), from the SPAA signal of S sobrinus\ (4) K-R-R-K-E-T-K 
(SEQ ID NO: 15), from the PRGB signal ofE. faecalis; (5) K-R-R-K-A (SEQ ID 
NO: 16), from the TEE signal of 5. pyogenes; (6), K-R-R-H-V-A-K-H (SEQ ID 
NO: 17), from the FIM sorting signal of Actinomyces viscosus, and (7) K-R-R-K-S 
(SEQ ID NO: 1 8), from the BAG sorting signal of Streptococcus aglactiae; (8) K-R- 

30 K-E-E-N (SEQ ID NO: 1 9), from the EMM signal of Streptococcus pyogenes. 

Also usable as the charged tail portion of the sorting signal are the 
following sequences produced by mutagenesis from the SPA signal of 5. aureus. 
These include R-R-R-E-S (SEQ ID NO: 20), R-R-R-S-L (SEQ ID NO: 21), R- 
R-S-E-L (SEQ ID NO: 22), R-S-R-E-L (SEQ ID NO: 23) and S-R-R-E-L (SEQ 

35 ID NO: 24). Other charged tails that are usable as part of the sorting signal can be 
derived from a poly serine tail, itself inactive, by replacement of one or more of the 
serine residues with the basic amino acid arginine. These include R-R-S-S-S (SEQ 
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ID NO: 25), R-S-R-S-S (SEQ ID NO:26), and S-R-R-S-S (SEQ ID NO:27). 
Other sorting signals can also be used. 

II. THE GENE ENCODING THE SORTASE-TRANS AMIDASE ENZYME 
5 A. Isolation of the Sortase-Transamidase Enzvtne Gene 

The gene for the sortase-transamidase enzyme in Staphylococcus 
aureus has been isolated. The isolation process is described in detail in the Example 
below; in general, this process comprises: (1) the generation of temperature-sensitive 
mutants through chemical mutagenesis, such as with the DNA modifying agent N- 
10 methyl-N-nitro-N-nitrosoguanidine; (2) Screening for temperature-sensitive 

mutants; (3) screening the temperature-sensitive mutants for a block in protein sorting 
by the use of a construct harboring the staphylococcal enterotoxin B (SEB) gene fused 
to the cell wall sorting signal of staphylococcal Protein A (SPA), to locate mutants 
that accumulate a precursor molecule formed by cleavage of an amino-terminal signal 
15 peptide but that is not then processed by cleavage of the carboxyl-terminal sorting 
signal; (4) generation of a 5. aureus chromosomal library and complementation of the 
temperature-sensitive sorting defect; and (5) sequencing and characterization of the S. 
aureus complementing determinants. 

20 B. Sequence of the Sortase-Transamidase Gene 

The above procedure yielded a partial sequence for one of the subunits 
of the sortase-transamidase including the carboxyl-terminal portion of the gene for 
the first subunit. This sequence is 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
25 AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 
TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATTTT 
30 AATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTTA 
TAGTTTTAACATAA (SEQ ID NO:28 & 29). The last three nucleotides, TAA, of 
this sequence are the stop codon. 

The above procedure further yielded a sequence for a second subimit of 
ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
35 AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAA/^ 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 
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TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAAm 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30 & 3 1) The last three nucleotides of this sequence, 
5 TAA, are the stop codon. 

Accordingly, within the scope of the present invention is a nucleic acid 
sequence encoding a substantially purified sortase-transamidase enzyme fi-om a 
Gram-positive bacterium. The enzyme encoded has at least one subunit with a 
molecular weight of about 41 ,000 daltons and catalyzes a reaction that covalently 
10 cross-links the carboxyl-terminus of a protein having the sorting signal described 
above to the peptidoglycan of a gram-positive bacteriimi. The nucleic acid sequence 
includes therein the sequence of SEQ ID NO: 28 or a sequence complementary to 
SEQ ID NO: 28, or the sequence of SEQ ID NO: 30 or a sequence complementary to 
SEQ ID NO: 30. 

15 Also included within the present invention is a nucleic acid sequence 

encoding a substantially purified sortase-transamidase enzyme fi-om a Gram-positive 
bacterium with at least one subimit with a molecular weight of about 4 1 ,000 daltons, 
where the enzyme catalyzes the cross-linking reaction where the nucleic acid 
sequence hybridizes with at least one of: (1) the sequence of SEQ ID NO: 28; (2) a 

20 sequence complementary to SEQ ID NO: 28; (3) the sequence of SEQ ID NO: 30; or 
(4) a sequence complementary to SEQ ID NO: 30 with no greater than about a 15% 
mismatch under stringent conditions. Preferably, the degree of mismatch is no greater 
than about 5%; most preferably the mismatch is no greater than about 2%. 

Also within the present invention is a nucleic acid sequence encoding a 

25 substantially purified sortase-transamidase enzyme from a Gram-positive bacterium 
where the enzyme has at least one subunit with a molecular weight of about 41,000 
daltons and catalyzes the cross-linking reaction described above involving the sorting 
signal, where the enzyme includes therein an amino acid sequence selected fix)m the 
group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E- 

30 I-R-P-D-M-T-G-A-E-A-D-A-I-S-R-N-Y-L-E-S-K-G-Y-G-K-E-F-G- 
H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A-R-T-I-Q-D-K-L-Q-V-N- 
N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E-D-D-I-L-I-T-E-N-G-C- 
Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID N0:2); (2) M-V-K-V-T-D-Y- 
S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-l-A-S-I-A-T-S-E-V-E-G-I-T- 

35 G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 
N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-i-T-Q-I-V-I-E-K (SEQ ID NO: 
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31); and (3) sequences incorporating one or more conservative amino acid 
substitutions in SEQ ID N0:2 or SEQ ID NO: 31 wherein the conservative amino acid 
substitutions are any of the foUovving: (1) any of isoleucine, leucine and valine for any 
other of these amino acids; (2) aspartic acid for glutamic acid and vice versa; (3) 
5 glutamine for asparagine and vice versa; and (4) serine for threonine and vice versa. 
Alternative nucleic acid sequences can be determined using the standard genetic code; 
the alternative codons are readily determinable for each amino £icid in this sequence. 

Construction of nucleic acid sequences according to the present 
invention can be accomplished by techniques well known in the art, including solid- 
10 phase nucleotide synthesis, the polymerase chain reaction (PGR) technique, reverse 
transcription of DNA from RNA, the use of DNA polymerases and ligases, and other 
techniques. If an amino acid sequence is known, the corresponding nucleic acid 
sequence can be constructed according to the genetic code. 

15 C. Vectors and Host Cells Transformed vnth Vectors 

Another aspect of the invention is a vector comprising a nucleic acid 
sequence according to the present invention operatively linked to at least one control 
sequence that controls the expression or regulation of the nucleic acid sequence. Such 
control sequences are well known in the art and include operators, promoters, 

20 enhancers, promoter-proximal elements and replication origins. The techniques of 
vector construction, including cloning, ligation, gap-filling, the use of the polymerase 
chain reaction (PCR) procedure, solid-state oligonucleotide synthesis, and other 
techniques, are all well known in the art and need not be described further here. 

Another aspect of the present invention is a host cell transfected with a 

25 vector according to the present invention. Among the host cells that can be used are 
gram-positive bacteria such as Staphylococcus aureus. 

Transfection, also known as transformation, is done using standard 
techniques appropriate to the host cell used, particularly Staphylococcus aureus. Such 
techniques are described, for example, in R.P. Novick, "Genetic Systems in 

30 Staphylococci," Meth. Enzvmol. 204: 587-636 ( 1 99 1 ), as well as in 0. Schneewind et 
aL, "Sorting of Protein A to the Staphylococcal Cell Wall," CeU 70: 267-281 (1992). 

III. SORTASE-TRANSAMIDASE AS A TARGET FOR ANTIBIOTIC ACTION 
A. A Site for Antibiotic Action 
35 The reaction carried out by the sortase-transamidase of the present 

invention presents a possible target for a new class of antibiotics to combat medically ' 
relevant infections caused by numerous gram-positive organisms. Because this is a 
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novel site of antibiotic action, these antibiotics have the advantage that resistance by 
the bacterium has not had a chance to develop. 

Such antibiotics can include compounds with structures that mimic the 
cleavage site, such as compounds with a structure similar to methyl 
5 methanethiosulfonate or, more generally, alkyl methanethiosulfonates. The sortase- 
transamidase of the present invention is believed to be a cysteine protease. Other 
antibiotics that may inhibit the activity of the sortase-transamidase in the present 
invention include inhibitors that would be specific for cysteine-modification in a p- 
lactam framework. These inhibitors would have active moieties that would form 

10 mixed disulfides with the cysteine sulfhydryl. These active moieties could be 

derivatives of methanethiosiilfonate, such as methanethiosulfonate ethylammonium, 
methanethiosulfonate ethy Itrimethylanmionium, or methanethiosulfonate 
ethylsulfonate (J.A. Javitch et al., "Mapping the Binding Site Crevice of the 
Dopamine D2 Receptor by the Substituted-Cysteine Accessibility Method." Neuron . 

15 14: 825-83 1 (1995); M.H. Akabas & A. Karlin, "Identification of Acetylcholine 
Receptor Channel-Lining Residues in the Ml Segment of the a-Subvmit," 
Biochemistry 34: 12496-12500(1995)). Similar reagents, such as alkyl 
alkanethiosulfonates, i.e., methyl methanethiosulfonate, or alkoxycarbonylalkyl 
disulfides, have been described (D.J. Smith et al., "Simple Alkanethiol Groups for 

20 Temporary Blocking of Sulfhydryl Groups of Enzymes." Biochemistry 14: 766-771 
(1975); W.N, Valentine & D.E. Paglia, "Effect of Chemical Modification of 
Sulfliydryl Groups of Human Erythrocyte Enzymes," Am. J. Hematol. 1 1 : 1 1 1-124 
(1981)). Other usefiil inhibitors involve derivatives of 2-trifluoroacety laminobenzene 
sulfonyl fluoride (J.C. Powers, "Proteolytic Enzymes and Their Active-Site-Specific 

25 Inhibitors: Role in the Treatment of Disease," in Modification of ProteinsV in a p- 
lactam fi-amework, peptidyl aldehydes and nitriles (E. Dufour et al., "Peptide 
Aldehydes and Nitriles as Transition State Analog Inhibitors of Cysteine Proteases," 
Biochemistry 34: 9136-9143 (1995); J. O; Westerik & R. Wolfenden, "Aldehydes as 
Inhibitors of Papain," J. Biol. Chem. 247: 8195-8197 (1972)), peptidyl diazomethyl 

30 ketones (L. Bjorck et al., "Bacterial Grov^ Blocked by a Synthetic Peptide Based on 
the Structure of a Human Proteinase Inhibitor," Nature 337: 385-386 (1989)), 
peptidyl phosphonamidates (P.A. Bartlett & C.K. Marlowe, "Phosphonamidates as 
Transition-State Analogue Inhibitors of Themiolysm," Biochemistry 22: 4618-4624 
(1983)), phosphonate monoesters such as derivatives or analogues of m- 

35 carboxyphenyl phenylacetamidomethylphosphonate (R.F. Pratt, "Inhibition of a Class 
C p-Lactamase by a Specific Phosphonate Monoester," Science 246: 917-919 
(1989)), maleimides and their derivatives, including derivatives of such bifimctional 
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maleimides as o-phenylenebismaleimide, E-phenylenebismaleimide, m- 
phenylenebismaleimide, 2,3^aphthalenebismaleimide, 1,5- 
naphthalenebismaleimide, and azophenylbismaleimide, as well as monofunctional 
maleimides and their derivatives (J.V. Moroney et al., "The Distance Between Thiol 

5 Groups in the y Subunit of Coupling Factor 1 Influences the Proton Permeability of 
Thylakoid Membranes.'' J. Bioenerget. Biomembr. 14: 347-359 (1982)), peptidyl 
halomethyl ketones (chloromethyl or fluoromethyl ketones), peptidyl sulfonium salts, 
peptidyl acyloxymethyl ketones, derivatives and analogues of epoxides, such as E-64 
(N-[N-(L-fi2ns-carboxyoxiran-2-carbonyl)-L-leucylagmatme), E-64c (a 

1 0 derivative of E-64 in which the agmatine moiety is replaced by an isoamylamine 
moiety), E-64c ethyl ester, Ep-459 (an analogue of E-64 in which the agmatine 
moiety is replaced by a 1,4-diaminopropyl moiety), Ep-479 (an analogue of E-64 in 
which the agmatine moiety is replaced by a 1 ,7-diheptylamino moiety), Ep-460 (a 
derivative of Ep-459 in which the terminal amino group is substituted with a Z 

1 5 (benzyloxycarbonyl) group), Ep-1 74 (a derivative of E-64 in which the agmatine 
moiety is removed, so that the molecule has a free carboxyl residue from the leucine 
moiety), Ep-475 (an analogue of E-64 in which the agmatine moiety is replaced with 
a NH2-(CH2)2-CH-(CH3)2 moiety), or Ep-420 (a derivative of E-64 in which the 
hydroxyl group is benzoylated, forming an ester, and the leucylagmatine moiety is 

20 replaced with isoleucyl-O-methyltyrosine), or peptidyl 0-acyl hydroxamates (E 
Shaw, "Cysteinyl Proteases and Their Selective Inactivation), pp 271-347). Other 
inhibitors are known in the art. 

B. Screenmg Methods 
25 Another aspect of the present invention is a method for screening a 

compound for anti-sortase-transamidase activity. This is an important aspect of the 
present invention, because it provides a method for screening for compounds that 
disrupt the sorting process and thus have potential antibiotic activity against Gram- 
positive bacteria. 

30 In general, this method comprises the steps of: (1) providing an active 

fraction of sortase-transamidase enzyme; (2) performing an assay for sortase- 
transamidase activity in the presence and in the absence of the compoxmd being 
screened; and (3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound. 

35 The active fraction of sortase-transamidase enzyme can be a 

substantially purified sortase-transamidase enzyme preparation according to the 
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present invention, but can be a less purified preparation, such as a partially purified 
particulate preparation as described below. 

The enzymatic activity can be measured by the cleavage of a suitable 
substrate, such as the construct having the Staphylococcal Enterotoxin B (SEB) gene 
5 fused to the cell wall sorting signal of Staphylococcal Protein A (SPA). The cleavage 
can be determined by monitoring the molecular weight of the products by sodimn 
dodecyl sulfate-polyacrylamide gel electrophoresis or by other methods. 

One particularly preferred assay for sortase-transamidase activity is the 

following: 

10 Staphylococcal soluble RNA (sRNA) is prepaid from S. aureus by a 

modification of the technique of Zubay (G. Zubay, J. Mol. Biol. 4: 347-356 (1962)). 
An overnight culture of S. aureus is diluted 1 : 1 0 in TSB and incubated at 3TC for 3 
hr. The cells are harvested by centrifugation at 6000 rpm for 15 min. 

For every gram of wet cell pellets, 2 ml of 0.01 M magnesium acetate, 

15 0.001 M Tris, pH 7.5 is used to suspend the pellets. The cell pellets are beaten by 
glass bead beater for 45 minutes in 5 minute intervals. The suspension is centrifuged 
twice at 2500 rpm for 5 minutes to remove the glass beads, then 0.5 ml phenol is 
added to the suspension. The suspension is vigorously shaken for 90 minutes at 4''C, 
and then centrifuged at 1 8,000 x g for 1 5 minutes. The nucleic acids in the top layer 

20 are precipitated by addition of 0. 1 volume of 20% potassium acetate and 2 volumes of 
ethanol, then stored at 4°C for at least 36 hours. The precipitate is obtained by 
centrifugation at 5,000 x g for 5 minutes. Cold NaCl (1 ml) is added to the precipitate 
and stirred at 4°C for 1 hour. The suspension is centrifuged at 15,000 x g for 30 
minutes. The sediments are washed with 0.5 ml of cold 1 M NaCl. The supematants 

25 are combined and 2 volumes of ethanol is added to precipitate the tRNA. The 

precipitate is suspended in 0.1 ml of 0.2 M glycine, pH 10.3 and incubated for 3 hr at 
37*^C. This suspension is then made 0.4 M in NaCl and the RNA is precipitated by 
addition of 2 volumes of ethanol. The precipitate is dissolved in 0.7 ml of 0.3 M 
sodixim acetate, pH 7.0, To this is slowly added 0.5 volume of isopropyl alcohol, with 

30 stirring. The precipitate is removed by centrifugation at 8,000 x g for 5 min. This 
precipitate is redissolved in 0.35 ml of 0.3 M sodiimi acetate, pH 7.0. To this is added 
0.5 volume of isopropyl alcohol, using the same procedure as above. The precipitate 
is also removed by centrifugation. The combined supematants from the two 
centrifugations are treated further with 0.37 ml of isopropyl alcohol. The resulting 

35 precipitate is dissolved in 75 ^il of water and dialyzed against water overnight at 4**C. 
This sRNA is used in the sortase-transamidase assay. 



wo 99/09145 PCT/US98/16229 

-22- 

Particulate sortase-transamidase enzyme is prepared for use in the 
assay by a modification of the procedure of Chatterjee & Park (A.N. Chatterjee &'J.T. 
Park, Proc. Natl. Acad. Sci. USA 51 : 9-16 (1964)). An overnight culture of 5. aureus 
0S2 is diluted 1 :50 in TSB and incubated at 37°C for 3 hr. Cells are harvested by 
5 centrifiigation at 6000 rpm for 1 5 minutes, and washed twice with ice-cold water. 
The cells are disrupted by shaking 7 ml of 1 3% suspension of cells in 0.05 M Tris- 
HCl buffer, pH 7.5, 0.1 mM MgC^, and 1 mM 2-mercaptoethanol with an equal 
volume of glass beads for 10-15 minutes in a beater. The glass beads are removed by 
centrifiigation at 2000 rpm for 5 minutes. The crude extract is then centrifiiged at 
10 15,000 X g for 5 minutes. The supernatant is centrifiiged again at 100,000 x g for 30 
mmutes. The light yellow translucent pellet is resuspended in 2 to 4 ml of 0.02 M 
Tris-HCl buffer, pH 7.5, containing 0.1 mM MgCh and 1 mM 2-mercaptoethanol. 
This suspension represents the crude particulate enzyme and is used in the reaction 
mixture below, 

1 5 The supernatant firom centrifiigation at 1 00,000 x g is passed through 

gel filtration using a Sephadex® G-25 agarose column (Pharmacia) to remove 
endogenous substrates. This supernatant is also used in the reaction mixture. 

The complete reaction mixture contains in a final volume of 30 jxl (M. 
Matsuhashi et ai., Proc. Natl. Acad. Sci. USA 54: 587-594 (1965)): 3 funol of Tris- 

20 HCl, pH 7.8; 0.1 junol of MgCh; 1.3 ^mol of KCl; 2.7 nmol of [^H] glycine (200 
}aCi/|xmol); 2 nmol of UDP-M-pentapeptide; 5 nmol of UDP-N-acetylglucosamine; 
0.2 ^mol of ATP; 0.05 ^mol of potassium phosphoenolpyruvate; 2.05 jig of 
chloramphenicol; 5 ^ig of pyruvate kinase; 0.025 ^mol of 2-mercaptoethanol; 50 ^g 
of staphylococcal sRNA prepared as above; 4 |xg (as protein) of supernatant as 

25 prepared above; 271 |ag of particulate enzyme prepared as above; and 8 nmol of a 
synthesized soluble peptide (HHHHHHAQALEPTGEENPF) (SEQ ID NO: 32) as a 
substrate. 

The mixture is incubated at 20°C for 60 minutes. The mixture is then 
heated at 100°C for 1 minute. The mixture is diluted to 1 ml and precipitated with 50 
30 nl nickel resin, and washed with wash buffer ( 1 % Triton X-1 00, 0. 1 % sodium 
dodecyl sulfate, 50 mM Tris, pH 7.5). The nickel resin beads are counted in a 
scintillation coxmter to determine "^H bound to the beads. 

The effectiveness of the compound being screened to inhibit the 
activity of the sortase-transamidase enzyme can be determined by adding it to the 
35 assay mixture in a predetermined concentration and determining the resulting degree 
of inhibition of enzyme activity that results. Typically, a dose-response curve is 
generated using a range of concentrations of the compound being screened. 
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The particulate enzyme preparation of sortase-transamidase employed 
in this protocol can be replaced with any other sortase-transamidase preparation,- 
purified or crude, staphylococcal, recombinant, or from any other source from any 
other Gram-positive bacterium as described above. 
5 The soluble peptide is captured in this embodiment by its affinity for 

nickel resin as a result of the six histidine residues. More than six histidine residues 
can be used in the peptide. As an alternative, the soluble pieptide can be captured by 
an affinity resulting from other interactions, such as streptavidin-biotin, glutathione 
S-transferase-glutathione, maltose binding protein-amylose, and the like, by 

10 replacing the six histidine residues with the amino acid sequence that constitutes the 
bmdmg site in the peptide and employing the appropriate solid phase affinity resin 
containing the binding partner. Suitable peptides can be prepared by solid phase 
peptide synthesis using techniques well known in the art, such as those described in 
M. Bodanszky, "Peptide Chemistry: A Practical Textbook" (2d ed., Sprmger-Verlag, 

15 Berlin, 1993). For example, if the glutathione S-transferase-glutathione mteraction is 
used, the active site of glutathione S-transferase (D.B. Smith & K.S. Johnson, 
"Single-Step Purification of Polypeptides Expressed in Escherichia coli as Fusions 
with Glutathione S-Transferase " Gene 67: 3 1-40 (1 988)) can be substituted for the 
six histidine residues, and glutathione can be bound to the solid support. 

20 

IV. USE OF SORTASE-TRANSAMIDASE FOR PROTEIN AND PRPTTDR 
DISELAX 

A. Methods for Protein and Peptide Display 

The sortase-transamidase enzyme of the present invention can also be 
25 used in a method of displaying a polypeptide on the surface of a gram-positive 
bacterium. 

In general, a first embodiment of this method comprises the steps of: 
(1) expressing a polypeptide having a sorting signal at its carboxyl-terminal end as 
described above; (2) forming a reaction mixture including: (i) the expressed 

30 polypeptide; (ii) a substantially purified sortase-transamidase enzyme; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and (3) allowing the sortase-transamidase to catalyze a 
reaction that cleaves the polypeptide within the LPX3X4G motif of the sorting signal 
and covalently cross-links the amino-terminal portion of the cleaved polypeptide to 

35 the peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 
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In this method, the polypeptide having the sorting signal at its 
carboxy-terminal end need not be expressed in a Gram-positive bacteriimi; it can be 
expressed in another bacterial system such as Escherichia coli or Salmonella 
typhimiirium^ or in a eukaryotic expression system. 
5 The other method for protein targeting and display relies on direct 

expression of the chimeric protein in a Gram-positive bacterium and the action of the 
sortase-transamidase on the expressed protein. In general, such a method comprises 
the steps of: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

10 carboxyl-terminal sorting signal as described above, the chimeric protein including 
the polypeptide to be displayed; (2) growing the bacterium into which the nucleic acid 
segment has been cloned to express the cloned chimeric protein to generate a chimeric 
protein including therein a carboxyl-terminal sorting signal; and (3) covalent binding 
of the chimeric protein to the cell wall by the enzymatic action of the sortase- 

1 5 transamidase involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the protein is displayed on the surface of the gram-positive bacterium in such a 
way that the protein is accessible to a ligand. 

Typically, the Gram-positive bacterium is a species of Staphylococcus. 
A particularly preferred species of Staphylococcus is Staphylococcus aureus, 

20 However, other Gram-positive bacteria such as Streptococcus 

pyogenes^ other Streptococcus species, and Gram-positive bacteria of other genera 
can also be used. 

Cloning the nucleic acid segment encoding the chimeric protein into 
the Gram-positive bacterium is performed by standard methods. In general, such 

25 cloning involves: (1) isolation of a nucleic acid segment encoding the protein to be 
sorted and covalently linked to the cell wail; (2) joining the nucleic acid segment to 
the sorting signal; (3) cloning by insertion into a vector compatible with the Gram- 
positive bacterium in which expression is to take place; and (4) incorporation of the 
vector including the new chimeric nucleic acid segment into the bacterium. 

30 Typically, the nucleic acid segment encoding the protein to be sorted is 

DNA; however, the use of RNA in certain cloning steps is within the scope of the 
present invention. 

When dealing with genes from eukaryotic organisms, it is preferred to 
use cDNA, because the natural gene typically contains intervening sequences or 

35 introns that are not translated. Alternatively, if the amino acid sequence is known, a 
synthetic gene encoding the protein to be sorted can be constructed by standard solid- 
phase oligodeoxyribonucleotide synthesis methods, such as the phosphotriester or 
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phosphite triester methods. The sequence of the synthetic gene is determined by the 
genetic code, by which each naturally occurring amino acid is specified by one or 
more codons. Additionally, if a portion of the protein sequence is known, but the 
gene or messenger RNA has not been isolated, the amino acid sequence can be used to 

5 construct a degenerate set of probes according to the known degeneracy of the genetic 
code. General aspects of cloning are described, for example, in J. Sambrook et al., 
"Molecular Cloning: A Laboratory Manual" (2d ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York, 1989); in B. Perbal, "A Practical Guide to 
Molecular Cloning" (2d ed., John Wiley & Sons, New York 1988), in S.L. Berger & 

10 A.R. Kimmel, "Guide to Molecular Cloning Techniques" (Methods in Enzymology, 
vol. 152, Academic Press, Inc., San Diego, 1987), and in D.V. Goeddel, ed., "Gene 
Expression Technology" (Methods in Enzymology, vol. 185, Academic Press, Inc., 
San Diego, 1991). 

Once isolated, DNA encoding the protein to be sorted is then joined to 

15 the sorting signal. This is typically accomplished through ligation, such as using 

Escherichia coli or bacteriophage T4 ligase. Conditions for the use of these enzymes 
are well known and are described, for example, in the above general references. 

The ligation is done in such a way so that the protein to be sorted and 
the sorting signal are joined in a single contiguous reading fiame so that a single 

20 protein is produced. This may, in some cases, involve addition or deletion of bases of 
the cloned DNA segment to maintain a single reading fi-ame. This can be done by 
using standard techniques. 

Cloning is typically performed by inserting the cloned DNA into a 
vector containing control elements to allow expression of the cloned DNA. The 

25 vector is then incorporated into the bacterium in which expression is to occur, using 
standard techniques of transformation or other techniques for introducing nucleic 
acids into bacteria. 

One suitable cloning system for S. aureus places the cloned gene under 
the control of the BlaZRI regulon (P.Z. Wang et al., Nucl. Acids Res. 19:4000 

30 ( 1 99 1 )). Vectors and other cloning techniques for use in Staphylococcus aureus are 
described in B. Nilsson & L. Abrahmsen, "Fusion to Staphylococcal Protein A," in 
Gene Expression Technology, supra , p. 144-1 61. 

If the chimeric protein is cloned under control of the BlaZRI regulon, 
expression can be induced by the addition of the p-lactam antibiotic methicillin. 

35 Another aspect of the present invention is a polypeptide displayed on 

the surface of a Gram-positive bacterium by covalent linkage of an amino-acid 
sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, as described above. 
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Yet another aspect of the present invention is a covalent complex 
comprising: (1) the displayed polypeptide; and (2) an antigen or hapten covalently 
cross-linked to the polypeptide. 

5 B. Screening Methods 

These polypeptides associated with the cell surfaces of Giam-positive 
bacteria can be used in various ways for screening. For example, samples of 
expressed proteins from an expression library containing expressed proteins on the 
surfaces of the cells can be used to screen for clones that express a particular desired 
10 protein when a labeled antibody or other labeled specific binding partner for that 
protein is available. 

These methods are based on the methods for protein targeting and 
display described above. 

A first embodiment of such a method comprises: (1 ) expressing a 
15 cloned polypeptide as a chimeric protein having a sorting signal at its carboxy- 
terminal end as described above; (2) forming a reaction mixture including; (i) the 
expressed chimeric protein; (ii) a substantially purified sortase-transamidase enzyme; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; (3) binding of the 
20 chimeric protein covalently to the cell wall by the enzymatic action of a sortase- 
transamidase expressed by the Gram-positive bacterium involving cleavage of the 
chimeric protein within the LPX3X4G motif so that the polypeptide is displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 
25 specific binding partner to screen the chimeric protein for reactivity with the labeled 
specific binding partner. 

The nucleic acid segment encoding the chimeric protein is formed by 
methods well known in the art and can include a spacer. 

In the last step, the cells are merely exposed to the labeled antibody or 
30 other labeled specific binding partner, unreacted antibodies removed as by a wash, and 
label associated with the cells detected by conventional techniques such as 
fluorescence, chemiluminescence, or autoradiography. 

A second embodiment of this method employs expression in a Gram- 
positive bacterium that also produces a sortase-transamidase enzyme. This method 
35 comprises: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal as described above, the chimeric protein including 
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the polypeptide whose expression is to be screened; (2) growing the bacterium into 
which the nucleic acid segment has been cloned to express the cloned chimeric protein 
to generate a chimeric protein including therein a carboxyl-terminal sorting signal; (3) 
binding the polypeptide covalently to the cell wall by the enzymatic action of a 
5 sortase-transamidase expressed by the Gram-positive bacterium involving cleavage 
of the chimeric protein within the LPX3X4G motif so that the polypeptide is displayed 
on the surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 
specific binding partner to screen the chimeric protein for reactivity with the labeled 
1 0 specific binding partner. 

V. USE OF SORTED MOLECULES FOR DIAGNOSIS AND TREATMKNT OF 
BACTERIAL INFECTIONS 

Sorted molecules can also be used for the diagnosis and treatment of 

15 bacterial infections caused by Gram-positive bacteria. Antibiotic molecules or 
fluorescent or any other diagnostic molecules can be chemically linked to a sorted 
peptide segment, which may include a spacer as described above, and then can be 
injected into animals or humans. These molecules are then sorted by the sortase- 
transamidase so that they are covalently linked to the cell wall of the bacteria. 

20 In general, these methods comprise: (1) conjugating an antibiotic or a 

detection reagent to a protein including therein a carboxyl-terminal sorting signal to 
produce a conjugate; and (2) introducing the conjugate to an organism infected with a 
Gram-positive bacterium in order to cause the conjugate to be sorted and covalently 
cross-linked to the cell walls of the bacterium in order to treat or diagnose the 

25 infection. 

The antibiotic used can be, but is not limited to, a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 
chloramphenicol, or norfloxacin, or a derivative of these antibiotics. 

30 The detection reagent is typically an antibody or other specific binding 

partner labeled with a detectable label, such as a radiolabel. Such methods are well 
known in the art and need not be described further here. 

Accordingly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 

35 including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. 
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Yet another aspect of the present invention is a composition 
comprising the conjugate and a pharmaceutically acceptable carrier. 

In this context, the conjugates can be administered using conventional 
modes of administration, including, but not limited to, intravenous, intraperitoneal, 
5 oral, or intralymphatic. Other routes of administration can alternatively be used. Oral 
or intraperitoneal admmistration is generally preferred. The composition can be 
administered in a variety of dosage forms, which include, but are not limited to, liquid 
solutions or suspensions, tablets, pills, powders, suppositories, polymeric 
microcapsules or microvesicles, liposomes, and injectable or infiisible solutions. The 

10 preferred form depends on the mode of administration and the quantity administered. 

The compositions for administration preferably also include 
conventional pharmaceutically acceptable carriers and adjuvants known in the art such 
as human serum albumin, ion exchangers, alumina, lecithin, buffered substances such 
as phosphate, glycine, sorbic acid, potassium sorbate, and salts or electrolytes such as 

15 protamine sulfate. The most effective mode of administration and dosage regimen for 
the conjugates as used in the methods in the present invention depend on the severity 
and course of the disease, the patient's health, the response to treatment, the particular 
strain of bacteria infecting the patient, other drugs being administered and the 
development of resistance to them, the accessibility of the site of infection to blood 

20 flow, pharmacokinetic considerations such as the condition of the patient's liver 
and/or kidneys that can affect the metabolism and/or excretion of the administered 
conjugates, and the judgment of the treating physician. According, the dosages shoxdd 
be titrated to the individual patient. 



25 VI. USE OF SORTED POLYPEPTIDES FOR PRODUCTION OF VACCINES 

Additionally, the sorted polypeptides covalently crosslinked to the cell 
walls of Gram-positive bacteria according to the present invention have a number of 
uses. One use is use in the production of vaccines that can be used to generate 
immunity against infectious diseases affecting mammals, including both human and 

30 non-human mammals, such as cattle, sheep, and goats, as well as other animals such 
as poultry and fish. This invention is of special importance to mammals. The 
usefulness of these complexes for vaccine production lies in the fact that the proteins 
are on the surface of the cell wall and are accessible to the medium surrounding the 
bacterial cells, so that the antigenic part of the chimeric protein is accessible to the 

35 antigen processing system. It is well known that presenting antigens in particulate 
. form greatly enhances the immime response. In effect, bacteria containing antigenic 
peptides on the surfaces linked to the bacteria by these covalent interactions function 
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as natural adjuvants. Here follows a representative list of typical microorganisms that 
express polypeptide antigens against which useful antibodies can be prepared by tiie 
methods of the present invention: 

( 1 ) Fungi : Candida albicans, Aspergillus Jumigatus, Histoplasma 
5 capsulatum (all cause disseminating disease), Microsporum canis (animal ringworm). 

(2) Parasitic protozoa: (1) Plasmodium falciparum (malaria), 
Trypanosoma cruzei (sleeping sickness). 

(3) Spirochetes: (1) Borrelia bergdorferi (Lyme disease), Treponema 
pallidum (syphilis), Borrelia recurrentis (relapsing fever), Leptospira 

10 icterohaemorrhagiae (leptospirosis). 

(4) Bacteria: Neisseria gonorrhoeae (gonorrhea). Staphylococcus 
aureus (endocarditis). Streptococcus pyogenes (rheumatic fever), Salmonella typhosa 
(salmonellosis). Hemophilus influenzae (influenza), Bordetella pertussis (whooping 
cough), Actinomyces israelii (actinomycosis), Streptococcus mutans (dental caries), 

15 Streptococcus equi (strangles in horses), Streptococcus agalactiae (bovine mastitis), 
Streptococcus anginosus (canine genital infections). 

(5) Viruses: Human immunodeficiency virus (HIV), polio virus, 
influenza virus, rabies virus, herpes virus, foot and mouth disease virus, psittacosis 
virus, paramyxovirus, myxovirus, coronavirus. 

20 Typically, the lesulting immunological response occurs by both 

humoral and cell-mediated pathways. One possible immunological response is the 
production of antibodies, thereby providing protection against infection by the 
pathogen. 

This method is not limited to protein antigens. As discussed below, 
25 non-protein antigens or haptens can be covalently linked to the C-terminal cell-wall 
targeting segment, which can be produced as an independently expressed polypeptide, 
either alone, or with a spacer at its amino-terminal end. If a spacer at the amino- 
terminal end is used, typically the spacer will have a conformation allowing the 
efiRcient interaction of the non-protein antigen or hapten with the immune system, 
30 most typically a random coil or a-helical form. The spacer can be of any suitable 
length; typically, it is in the range of about 5 to about 30 amino acids; most typically, 
about 10 to about 20 amino acids. In this version of the embodiment, the 
independently expressed polypeptide, once expressed, can then be covalently linked to 
the hapten or non-protein antigen. Typical non-protein antigens or haptens include 
35 drugs, including both drugs of abuse and therapeutic drugs, alkaloids, steroids, 

carbohydrates, aromatic compounds, including many pollutants, and other compounds 
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that can be covalently linked to protein and against which an immune response can be 
raised. 

Ahematively, a protein antigen can be covalently linked to the 
independently expressed cell-wall targeting segment or a cell-wall targeting segment 
5 including a spacer. 

Many methods for covalent linkage of both protein and non-protein 
compounds to proteins are well known in the art and are described, for example, in P. 
Tijssen, "Practice and Theory of Enzyme Immunoassays" ^Isevier, Amsterdam, 
1985), pp. 221-295, and in S.S; Wong, "Chemistry of Protein Conjugation and 

10 Cross-Linking" (CRC Press, Inc., Boca Raton, FL, 1 993). 

Many reactive groups on both protein and non-protein compounds are 
available for conjugation. 

For example, organic moieties containing carboxyl groups or that can 
be carboxylated can be conjugated to proteins via the mixed anhydride method, the 

1 5 carbodiimide method, using dicyclohexylcarbodiimide, and the N- 
hydroxysuccinimide ester method. 

If the organic moiety contains amino groups or reducible nitro groups 
or can be substituted with such groups, conjugation can be achieved by one of several 
techniques. Aromatic amines can be converted to diazonium salts by the slow 

20 addition of nitrous acid and then reacted with proteins at a pH of about 9. If the 

organic moiety contains aliphatic amines, such groups can be conjugated to proteins 
by various methods, including carbodiimide, tolylene-2,4-diisocyanate, or malemide 
compounds, particularly the N-hydroxysuccinimide esters of malemide derivatives. 
An example of such a compound is 4-(N-maleimidomethyl)-cyclohexane-l- 

25 carboxylic acid. Another example is m-maleimidobenzoyl-N-hydroxysuccinimide 
ester. Still another reagent that can be used is N-succinimidyl-3-(2-pyridyldithio) 
propionate. Also, bifunctional esters, such as dimethylpimelimidate, 
dimethyladipimidate, or dunethylsuberimidate, can be used to couple amino-group- 
containing moieties to proteins. 

30 Additionally, aliphatic amines can also be converted to aromatic 

amines by reaction with ^nitrobenzoylchloride and subsequent reduction to a g- 
aminobenzoylamide, which can then be coupled to proteins after diazotization. 

Organic moieties containing hydroxyl groups can be cross-linked by a 
number of indirect procedures. For example, the conversion of an alcohol moiety to 

35 the half ester of succinic acid (hemisuccinate) introduces a carboxyl group available 
for conjugation. The bifimctional reagent sebacoyldichloride converts alcohol to acid 
chloride which, at pH 8.5, reacts readily with proteins. Hydroxyl-containing organic 
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moieties can also be conjugated through the highly reactive chlorocarbonates, 
prepared with an equal molar amount of phosgene. 

For organic moieties containing ketones or aldehydes, such carbonyl- 
containing groups can be derivatized into carboxyl groups through the formation of 
5 O-(carboxymethyl) oximes. Ketone groups can also be derivatized with 

hydrazinobenzoic acid to produce carboxyl groups that can be conjugated to the 
specific binding partner as described above. Organic moieties containing aldehyde 
groups can be directly conjugated through the formation of Schiff bases which are 
then stabilized by a reduction with sodium borohydride. 

10 One particxilarly useful cross-linking agent for hydroxyl-containing 

organic moieties is a photosensitive noncleavable heterobifunctional cross-linking 
reagent, sulfosuccinimidyl 6-[4'-azido-2'-nitrophenylamino] hexanoate. Other 
similar reagents are described in S.S. Wong, "Chemistry of Protein Conjugation and 
Cross-Linking," supra . 

15 Other cross-linking reagents can be used that introduce spacers 

between the organic moiety and the specific binding partner. 

These methods need not be described further here. 

VII. PRODUCTION OF SUBSTANTIALLY PURIFIED SORTASE- 
20 TRANSAMIDASE ENZYME 

Another aspect of the present invention is methods for the production 
of substantially purified sortase-transamidase enzyme. 

A. Methods Involving Expression of Cloned Gene 
25 One method for the production of substantially purified sortase- 

transamidase enzyme involves the expression of the cloned gene. The isolation of the 
nucleic acid segment or segments encoding the sortase-transamidase enzyme is 
described above; these nucleic acid segment or segments are then incorporated into a 
vector and then use to transform a host in which the enzyme can be expressed. In one 
30 altemative, the host is a Gram-positive bacterium. 

The next step in this altemative is expression in a Gram-positive 
bacteriiun to generate the cloned sortase-transamidase enzyme. Expression is 
typically under the control of various control elements associated with the vector 
incorporating the DNA encoding the sortase-transamidase gene; such elements can 
35 include promoters and operators, which can be regulated by proteins such as 
repressors. The conditions required for expression of cloned proteins in gram- 
positive bacteria, particularly S. aureus, are well known in the art and need not be 
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further recited here. An example is the induction of expression of lysostaphin under 
control of the BlaZRI regulon induced by the addition of methicillin. 

When expressed in Staphylococcus aureus, the chimeric protein is 
typically first exported with an amino-terminal leader peptide, such as the 
5 hydrophobic signal peptide at the amino-terminal region of the cloned lysostaphin of 
Recsei et al. (P. Recsei et al., "Cloning, Sequence, and Expression of the Lysostaphin 
Gene from Staphylococcus simulans,'*' Proc, Natl. Acad. Sci. USA 84: 1 1 27-1 1 3 1 
(1987)). 

Alternatively, the cloned nucleic acid segment encoding the sortase- 
10 transamidase enzyme can be inserted in a vector that contains sequences allowing 
expression of the sortase-transamidase in another organism, such as E. coli or S. 
typhimurium, A suitable host organism can then be transformed or transfected with 
the vector containing the cloned nucleic acid segment Expression is then performed 
in that host organism. 

1 5 The expressed enzyme is then purified using standard techniques. 

Techniques for the purification of cloned proteins are well known in the art and need 
not be detailed further here. One particularly suitable method of purification is 
affinity chromatography employing an immobilized antibody tosortase. Other protein 
purification methods include chromatography on ion-exchange resins, gel 

20 electrophoresis, isoelectric focusing, and gel filtration, among others. 

One particularly useful form of affinity chromatography for 
purification of cloned proteins, such as sortase-transamidase, as well as other 
proteins, such as glutathione S-transferase and thioredoxin, that have been extended 
with carboxyl-terminal histidine residues, is chromatography on a nickel-sepharose 

25 colirnm. This allows the purification of a sortase-transamidase enzyme extended at 
its carboxyl terminus with a sufficient number of histidine residues to allow specific 
binding of the protein molecule to the nickel-sepharose colunm through the histidine 
residues. The bound protein is then eluted with imidazole. Typically, six or more 
histidine residues are added; preferably, six histidine residues are added. One way of 

30 adding the histidine residues to a cloned protein, such the sortase-transamidase, is 
through PCR with a primer that includes nucleotides encoding the histidine r-esidues. 
The histidine codons are CAU and CAC expressed as RNA, which are CAT and CAC 
as DN A. Amplification of the cloned DNA with appropriate primers will add the 
histidine residues to yield a new nucleic acid segment, which can be recloned into an 

35 appropriate host for expression of the enzyme extended with the histidine residues. 
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B. Other Methods 

Alternatively, the sortase-transamidase can be purified fi-om Gram- 
positive bacteria by standard methods, including precipitation with reagents such as 
ammonium sulfate or protamine sulfate, ion-exchange chromatography, gel filtration 
5 chromatography, afiFuiity chromatography, isoelectric focusing, and gel 
electrophoresis, as well as other methods known in the art. 

Because the sortase-transamidase is a cysteine protease, one 
particularly useful method of purification involves covalent chromatography by thiol- 
disulfide interchange, using a two-protonic-state gel containing a 2- 
10 mercaptopyridine leaving group, such as Sepharose 2B-glutathione 2-pyridyl 
disulfide or Sepharose 6B-hydroxypropyl 2-pyridyl disulfide. Such covalent 
chromatographic techniques are described in K. Brocklehiirst et al,, "Cysteine 
Proteases," in New Comprehensive Biochemistry. Volume 16: Hvdrolvtic Enzymes 
(A. Neuberger &. K. Brocklehurst, eds., Elsevier, New York, 1987), ch. 2, pp. 39-158. 

15 

VIII. FURTHER APPLICATIONS OF SORTASE-TRANSAMTDASF 

A. Production of Antibodies 

Antibodies can be prepared to the substantially purified sortase- 
transamidase of the present invention, whether the sortase-transamidase is purified 

20 fi-om bacteria or produced fi*om recombinant bacteria as a result of gene cloning 
procedures. Because the substantially purified enzyme according to the present 
invention is a protein, it is an effective antigen, and antibodies can be made by well- 
understood methods such as those disclosed in E. Harlow & D. Lane, "Antibodies: A 
Laboratory Manual" (Cold Spring Harbor Laboratory, 1988). In general, antibody 

25 preparation involves immunizing an antibody-producing animal with the protein, with 
or without an adjuvant such as Freund's complete or incomplete adjuvant, and 
purification of the antibody produced. The resulting polyclonal antibody can be 
purified by techniques such as affinity chromatography. 

Once the polyclonal antibodies are prepared, monoclonal antibodies 

30 can be prepared by standard procedures, such as those described in Chapter 6 of 
Harlow & Lane, supra . 

B. Derivatives for Affinity Chromatography 

Another aspect of the present invention is derivatives of the cloned, 
35 substantially purified sortase-transamidase of the present invention extended at its 
carboxyl terminus with a sufficient mmiber of histidine residues to allow specific 
binding of the protein molecule to a nickel-sepharose colunm through the histidine 
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residues. Typically, six or more histidine residues are added; preferably, six histidine 
residues are added. 

The histidine residues can be added to the carboxyl terminus through 
PGR cloning as described above. 
5 This invention is further described by means of the following example. 

This Example is for illustrative purposes only, and are not to be construed as limiting 
the scope of the invention in any manner. 

EXAMPLE 

10 Identification of Sortase-Transamidase 

Generation of ts Mutants through Chemical Mutagenesis 

To create random mutations in the chromosome, Staphylococcus 
aureus strain 0S2 (RN4220 erm spa-) was mutagenized by exposure to the DNA- 
modifying agent N-methyl-N-nitro-N-nitrosoguanidine. Cultures were incubated 

1 5 with the mutagen for varying periods of time, then placed on TSB agar plates to 

measure viability. Cultures were subsequently plated on TSB+ rifampicin (10 pg/ml) 
to determine the mutation fi-equency based on resistance to the single target site 
antibiotic. Once a maximum mutation frequency was reached, cell cultures were 
exposed to two successive rounds of a penicillin selection (5 ^g/ml at 42'*C) to enrich 

20 for mutants that had a growth effect by lysis of the cells growing at this temperature. 
Mutants were screened for growth at SO^C and 42*C by streaking individual colonies 
on TSB agar plates at the permissive and non-permissive temperatures, respectively. 
These colonies that demonstrated a growth defect at the non-permissive temperature 
were rechecked at 42**C, and subsequently frozen at -80*C in a 5% BSA, 5% 

25 monosodium glutamate (MSG) solution. In this manner, a collection of temperature- 
sensitive mutants was assembled. 



Transformation and Screening of ts Mutants 

In order to isolate mutants that demonstrated a defect in the surface 

30 display of protein, it was necessary to develop a screening process to locate these 
strains. Previous studies that had indicated a typical secretory-dependent process, in 
conjunction with known C-terminal cleavage of translocated proteins, were used to 
elucidate a selection scheme to isolate the desirable mutants. The construct harboring 
the staphylococcal enterotoxin B (SEB) gene fused to the cell wall sorting signal of 

35 staphylococcal protein A (SPA) was used in this assay. This reporter molecule has 
been shown to be properly processed not only by the secretory machinery and through 
signal peptidase cleavage of an N-terminal secretion signal, but also to be correctly 
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sorted after secretion to its peptidoglycan substrate. Mutant cells were made 
competent using 0.5 M sucrose washes, and were transformed with the 
staphylococcal/£sc/zmc/iia coli shuttle vector pOS 1 containing the SEB-SPA 
reporter through electroporation. Transformants were selected by virtue of their 
5 resistance to chloramphenicol as encoded by the plasmid. The cells were then 

screened for properties indicative of a defect in the processing of precursor molecules 
to a fiilly matured and anchored surface protein. 

Cultured cells were induced at 42**C, and then pulsed with S-35 
ProMix (80% Met, 20% Cys), in order to label all synthesized proteins. Samples were 

10 precipitated through acid treatment, then digested with lysostaphin (1 00 jig/ml) and 
subsequently reprecipitated. This was followed by solubilization in hot sodium 
dodecyl sulfate (4%), and an immunoprecipitation vsdth anti-SEB antibodies. 
Samples were run on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and 
finally exposed to phosphorimager evaluation. The resulting banding patterns were 

15 analyzed and quantitated. 

This procedure yielded three anti-SEB reacting species, termed PI, P2 
and M. PI, the largest precursor which migrated to 33 kDa, represents the complete 
gene product encoded by the SEB-SPA construct, with no modifications. P2, the 
second precursor, most likely represents the product after cleavage of the amino- 

20 terminal signal peptide foimd in SEB, and thus migrates to 32 kDa. The smallest 
species, M, is a lysostaphin-solubilized, maturely anchored peptide that has neither 
the signal peptide nor the remainder of the carboxyl-terminal sorting signal after the 
cleavage. This band migrates at approximately 29 kDa. 

Analysis of these species was conducted through phosphorimager 

25 quantitation, and mutants were selected based upon the proposed phenotype that the 
inhibition in sorting would result in an accimiulation of P2 and a reduction in the 
production of M. Through this process, two mutants, SM317 and SM329, were 
earmarked for fiirther analysis due to their elevated ratio of P2/M over wild-type. 

Both mutants finally demonstrated an accumulation of P2 after a 5 min 

30 kinetic analysis, but also clearly showed a decrease in the mutant's ability to degrade 
the species over time as measured by samples chased with cold methionine. These 
results were interpreted to mean that the ability of the mutants to process mature cell 
wall anchored peptides was impaired, quite possibly due to the less efficient activity 
of the sortase-transamidase enzyme. 

35 Protoplasts were made of these mutants in order to cure them of their 

plasmids encoding the reporter construct, and subsequently retransformed with the 
SEB/SPA containing plasmid to once again test for preservation of this phenotype. 
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Upon a favorable result, the mutants were prepared for complementation by a S. 
aureus chromosomal library. 

Generation of an S. aureus Chromosomal Library and Complementation of ts Defect 
5 A staphylococcal library was made through a Sau3 AI digest of the 

chromosomal DNA preparation from 5. aureus strain RN4220. DNA was isolated 
through a phenol-chloroform extraction from lysed cells, and digested for various 
times until the correct partial digest pattern was observed. Fragments greater than 2.5 
kb were inserted into the BamHI cloning site in the multi-cloning sequence (MCS) of 
10 plasmid pC 1 94-MCS. This heterogeneous mixture of plasmids was then transformed 
into competent 0S2 cells. Approximately 15,000 clones were harvested. DNA was 
prepared and transformed into competent cells made of both mutants, and 
simultaneously plated at SO'C and 42**C to screen for complementation of the ts 
mutant phenotype. 

15 Through this process, four chromosomal inserts from each mutant were 

foimd to complement the ts phenotype by conferring growth at 4TC, Due to the 
nature of the mutagenesis, it is at this point necessary to demonstrate definitively that 
the ts defect is somehow linked to the defect in processing. This is done through 
illustration that the plasmids harboring the chromosomal inserts not only complement 

20 the temperature sensitivity, but also relieve the accumulation of P2 at the expense of 
M in these mutants. Therefore, the complemented mutants were screened along with 
the non-complemented versions against wild type 0S2. 

Screen for Sorting Defect Complementation 

25 This assay was conducted differently for each of the two mutants. 

SM-3 1 7 was screened by the insertion of the SEB/SPA fragment aboard the 
replication defective pCL 84 vector that possessed integration capability into the 
chromosome of 5. aureus cells. The site-specific integration, mediated by the 
integrase gene supplied in trans by pCLl 12, disrupts the lipase gene, which can be 

30 assayed for by the lack of hydrolysis on egg yolk agar plates. Once successfully 

integrated, the RN4220 chromosomal fragments that complement the ts mutation can 
be added to make the cells ready for screening. 

SMS 29 was assayed by another approach. The pC 194 plasmid 
harboring the complementing stretch of DNA was fused to an E. coli replicon 

35 pHSG399 that contained the SEB/SPA gene. The shuttle vector thus provided both a 
reporter substrate as well as a complementing activity. 
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The result of this screen demonstrated that over several time periods of 
the pulse-chase, the complementing insert added to the mutant reduced the accretion 
of P2 from the elevated level characteristic of the mutant to the wild type range in 
both mutants, the more dramatically in SM-329. This may be due to the fact that the 
5 substrate for sorting in SM-3 17 is found in only one copy per cell, whereas in SM- 
329, the reporter aboard pC194 is present at approximately 15 copies. Nevertheless, 
these results indicate that these mutants are in the sortase-transamidase gene and the 
sequencing of both chromosomal inserts was therefore undertaken. It should be 
pointed out that the complementing activity of each of the respective mutants was not 
10 transferable to the other, neither in terms of temperature sensitivity nor for the 

processing defect. Also, the four complementing clones isolated from each mutant 
seemed to behave in an identical maimer, and also seemed to possess very similar 
restriction sites when digested with various specific endonucleases. Therefore, one 
clone was chosen from each mutant for sequencing. 

15 

Sequencing and Characterization of the S. aureus Complementing Determinants 

The chromosomal inserts carrying the sorting defect complementing 
capabilities were sequenced. This work was done completely by automated sequence 
analysis using dideoxyribonucleotides. Sequence data was confumed by duplicate 

20 analysis of both strands of DNA. Comparison was done to all known nucleotide and 
protein sequence was currently found in the GenBank service. The partial, crude 
sequence of the S. aureus gene is shown in Figure 10 (SEQ ID NO: 28 & 29). The 
partial carboxy-terminal amino acid sequence of the open reading fi-ame generated 
from the gene sequence of Figure 10 (SEQ ID NO: 28 & 29) is shown in Figure 1 1 

25 (SEQ ID NO: 2). 

Several stretches of high homology were foimd, to both known and 
putative proteins of varying fimction. The first 364 bases of the SM-3 1 7 
complementing gene insert been identified as encoding a protein that is a homologue 
of a putative Bacillus peptidase in the GCVT-SPOIIIAA intergenic region (GenBank 

30 Accession No. 1 73 1 048; Y. Kobayashi et al.). The sequence of this putative peptidase 
is shown in Figure 5 (SEQ ID NO: 3) and its hydrophobicity profile is shown in 
Figure 6. The hydrophobicity is calculated according to the method of J. Kyte & R.F. 
Doolittle, "A Simple Method for Displaying the Hydropathic Character of a Protein," 
J. Mol. Biol. 157: 105-132 (1982). To a lesser degree of homology, the protein 

35 encoded by this complementing gene insert is homologous to aminopeptidase P of 
Lactococcus lactis (GenBank Accession No. 1915907; J. Matos). The amino acid 
sequence of this aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still 
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lesser degree of homology, the protein encoded by this complementing gene insert is 
homologous to the proline dipeptidase of Lactobacillus delbruechi lactis (GenBank 
Accession No. 1 172066; K. Stucky et al., "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene from Lactobacillus delbrueckii subsp. lactis and Partial 
5 Characterization of Its Product," MoL Gen. Genet. 247: 494-500 (1995). The amino 
acid sequence of this proline dipeptidase is shown in Figure 8 (SEQ ID NO: 5). 

An additional complementing gene insert, in a vector designated 
pCOMPl, has also been sequenced. The DNA sequence of this complementing gene 
insert is shown in Figure 12 (SEQ ID NO: 30), together with the amino acid sequence 

1 0 (SEQ ID NO: 3 1 ) of the protein translated from this DNA sequence, and the 

hydrophobicity profile of the protein translated from the DNA sequence is shown in 
Figure 13. The amino acid sequence of the protein translated from the sequence of 
Figure 12 (SEQ ID NOS: 30 & 31) has virtually no homology with the amino acid 
sequence of the protein shown in Figure 1 1 (SEQ ID NO: 2). In particular, the amino 

15 acid sequence of the protein of Figure 12 (SEQ ID NOS: 30 & 3 1) has a single 
cysteine residue. 

Although Applicants do not intend to be bound by this theory, the 
existence of these two complementing inserts that define different polypeptide 
sequences suggests that the sortase-transamidase enzyme of S. aureus is a 

20 heterooligomer, with two or more different subunits that have different amino acid 
sequences. Mutations in at least two different subunits can give rise to the 
temperature sensitive phenotype and can then be complemented for. Alternatively, the 
synthesis of the peptidoglycan may require additional enzymes. 

Upon completion of the sequencing, and a study of all open reading 

25 frames, candidate genes were selected for further analysis. These genes were 
expressed in the mutants to determine if they complement both the ts and sorting 
defects. Upon success in this capacity, the genes were disrupted in wild-type S. 
aureus to determine their essentiality and possible biological roles. 



30 Materials and Methods 

Mutagenesis of S. aureus Strain 0S2 . S. aureus strain OS2 (RN4220 

erm spa-) was mutagenized by treatment with N-methyl-N-nitro-N- 

nitrosoguanidine at 2 mg/ml. Culture OD was measured at 660 nm for viability. 

After 45 min, cultures were spun down at 4,000 rpm for 10 min, and washed with 
35 citrate buffer, pH 5.5. Cells were resuspended in citrate buffer to a concentration of 

5x10^ cells/ml. These cultures were serially diluted in TSB (tryptic soy broth) and 
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plated on TSB+ rifampicin (10 |iig/ml) agar plates. Mutation frequencies were 
determined to be 5x 1 0^ mutation/cfu. 

Enrichment and Selection of ts Mutants . A 1 : 1 00 dilution of cell 
cultures grown overnight at 30'C was added to TSB, and allowed to grow for 2 hr, at 
5 which time penicillin G was added at 5 ^lg/ml. Culture viability was measured by 
taking OD readings at 660 nm at various times until the concentration dropped to a 
stable pomt. Cells were washed twice in TSB, and the enrichment was repeated a 
second time. 

After two successive rounds of penicillin selection for growth arrest at 

10 42*C, individual colonies were picked and simultaneously streaked on duplicate TSB 
plates incubated at 30**C and 42*'C. Those colonies that had a growth defect at 42*C 
were rechecked at this nonpermissive temperature, and stored at -80*C in a 5% BSA 
5% MSG solution. 

Transformation of Competent Cells . Mutant cells were made 

1 5 competent by diluting overnight cultures 1 : 1 0 in TSB, and growing to 0.3 OD at 660 
nm. Cells were then spun down at 7,500 rpm for 15 min and resuspended in an equal 
volume of 0.5 M sucrose. After another pelleting, cells were resuspended in 0.5 
volume sucrose, and incubated for 30 min at 4'*C. Following another spin, cells were 
brought up in 0.1 volxmie sucrose. These competent cells were then transformed with 

20 the appropriate plasmids encoding chloramphenicol (10 )xg/ml) resistance by 

electroporation at 200 ohms, 25 |iF, and 2.5 kV in 0.2 cm cuvettes. Cells were plated 
on TSB plus chloramphenicol incubated at SO^C. Transformed strains were fi-ozen in 
BSA/MSGat-80"C. 

Pulse-Chase Screen of Mutants . Strains were inoculated in chemically 

25 defined media with chloramphenicol and grown overnight at 30*C. Cultures were 
diluted 1:10 into medixmi IV (O. Schneewind et al., "Sorting of Protein A to the 
Staphylococcal Cell Wall," Cell 70: 267-281 (1992)), grown for 3 hr, and then 
induced at 42*C for 20 min. At this time, cultures were pulse labeled with 50 ^Ci of 
S-35 ProMix for 5 min, and then terminated with 5% trichloroacetic acid (TCA). 

30 Cells were incubated at 4° for 30 min, centrifiiged at 12,500 rpm for 1 5 min, and the 
supematants aspirated. After resuspension in acetone, cells were spim again and 
aspirated to dryness. At this time, cells were treated with lysostaphin (100 (ig/ml) for 
30 min or until noticeable clearing, and subjected to another TCA/acetone 
precipitation. After lysis of cells by boiling for 10 min with 4% SDS in 0.5m Tris, pH 

35 8.0, proteins were inununoprecipitated with anti-SEB for 1 hr and protein A- 

Sepharose beads for another 1 hr. Samples were washed three times in RIP A buffer, 
pH 8.0 (300 mM NaCl, 2% Triton X-100, 1% deoxycholate, 0.2% SDS), and protein 
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was solubilized with SDS-urea sample buffer (50 mM Tris-HCl, pH 6.8, 3% SDS, 
5% 2-mercaptoethanol, 3.5 M urea), with boiling for 10 min. Samples were run on 
SDS-polyacrylamide gel electrophoresis gels and exposed to a phosphorimager screen 
overnight. Quantitations were done on ImageQuant software. 
5 DNA Sequencing . DN A was sequenced on a Perkin-Elmer automated 

sequencer after PGR using dye-terminating ready reaction mixed with SS-Taq 
polymerase. GenBank analysis was done using BLAST software to search the 
database. 

10 ADVANTAGES OF THE PRESENT INVENTTON 

In isolating and characterizing the gene for the S, aureus sortase- 
transamidase enzyme, we have determined the existence of a new site for antibiotic 
action that can be used to screen new antibiotics active against Gram-positive 
pathogens, such as Staphylococcus, Actinomyces, Mycobacterium, Streptococcus, 

15 Bacillus, and other medically important Gram-positive pathogens increasingly 
resistant to conventional antibiotics. The availability of substantially purified X 
aureus sortase-transamidase enzyme provides a method of screening compounds for 
inhibition of the enzyme. 

The purified sortase-transamidase enzyme of the present invention 

20 also yields a method of surface display of peptides and proteins that has advantages 
over phage display, as well as providing methods for producing vaccines against a 
large variety of antigens that can be covalently bound to the surfaces of Gram- 
positive bacteria. 

Although the present invention has been described Avith considerable 
25 detail, with reference to certain preferred versions thereof, other versions and 

embodiments are possible. Therefore, the scope of the invention is determined by the 
following claims. 
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We claim: 

1 . A substantially pxirified sortase-transamidase enzyme from a 
Gram-positive bacterium, the enzyme catalyzing a reaction that covalently cross- 
links the carboxyl terminus of a protein having a sorting signal to the peptidoglycan of 
a Gram-positive bacterium, the sorting signal having a motif of LPX3X4G therein, 
wherein sorting occurs by cleavage between the fourth and fifth residues of the 
LPX3X4G motif. 

2. The substantially purified sortase-transamidase enzyme of claim 1 
wherein the Gram-positive bacterium is a species selected from the group consisting 
of Staphylococcus aureus^ S, sobrinus, Enter ococcus faecalis^ Streptococcus 
pyogenes, and Listeria monocytogenes, 

3. The substantially purified sortase-transamidase enzyme of claim 2 
wherein the Gram-positive bacterium is Staphylococcus aureus. 



4. The substantially purified sortase-transamidase enzyme of claim 1 
20 wherein one subunit of the enzyme has a molecular weight of about 41 ,000 daltons. 



5. The substantially purified sortase-transamidase enzyme of claim 4 
wherein the sorting signal further comprises:(2) a substantially hydrophobic domain of 
at least 31 amino acids carboxyl to the motif; and (3) a charged tail region with at least 

25 two positively charged residues carboxyl to the substantially hydrophobic domain, at 
least one of the two positively charged residues being arginine, the two positively 
charged residues being located at residues 31-33 from the motif, wherein X3 is any of 
the twenty naturally-occurring L-amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine. 

30 

6. The enzyme of claim 1 wherein the enzyme includes therein an 
amino acid sequence selected from the group consisting of :(1) D-P-K-L-K-E-I- 
Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S- 
R-N-Y-L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G- 

35 P-M-L-A-R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L- 
G-I-R-l-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T 
(SEQ IDNO: 2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L- 
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S-V-I-A-S-I-A-T-S-E-V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V- 
S-R-K-N-L-S-R-D-L-K-I-E-S-K-E-G--I-Y-I-D-V-Y-C--A-L-K-H-G- 
V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I- 
N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 31); and (3) sequences incorporating one 
5 or more conservative amino acid substitutions in SEQ ID N0:2 or SEQ ID NO: 3 1 , 
wherein the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 

10 

7. The enzyme of claim 6 wherein the amino acid sequence is D-P- 
K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-TH3"A-^^ 
A-D-A-I-S-R-N-Y-L-E-S-KH3-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L- 

e-i-h-e-g-p-m-l-a-r-t-i-q-d-k-l-q-v-n-n-c-v-t-v-e-p-g-v- 
15 y-i-eh3-l-g-i-r-i-e-i>"I>i-l-i-t-e-nh><:h5-V"F-t-^ 
l-i-v-l-t (seq id no: 2). 

8. The enzyme of claim 6 wherein the amino acid sequence is M-V- 
K-V-T-D-Y-S-N-S-K-L-G-K-E-FA-P-E-V-L-S-V-I-A-S-I-A-T-S-E- 

20 V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L- 
K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I- 
Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-l-N-I-H-I-T-Q-I-V-I-E-K 
(SEQ ID NO: 31), 

25 9. A nucleic acid sequence encoding the enzyme of claim 6, 

10. A nucleic acid sequence encoding the enzyme of claim 7. 

11 . A nucleic acid sequence encoding the enzyme of claim 8. 

30 

12. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase en2yme from a Gram-positive bacterium, the enzyme having a subunit 
with a molecular weight of about 41,000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

35 peptidoglycan of a Gram-positive bacterium, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
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residues carboxy 1 to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L— amino acids and X4 is selected from the group consisting of alanine, 
5 serine, and threonine, and wherem sorting occurs by cleavage between the fourth and 
fiifth residues of the LPX3X4G motif, wherein the nucleic acid sequence includes 
therein a sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 
TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 

15 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAA^ 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAG^ 
20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTT^ 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30. 

25 

13. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacteriiun, the enzyme having a subunit 
with a molecular weight of about 41,000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

30 peptidoglycan of a Gram-positive bacterium, the sorting signal having (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 

35 located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine, and wherein sorting occurs by cleavage between the fourth and 
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fifth residues of the LPX3X4G motif, wherein the nucleic acid sequence hybridizes 
with a sequence selected from the group consisting of: (1) 

gatcctaaactgaaagaaatatatcaaatagtacttgaatctcaaatgaa 
agcaattaatgagattagacctggcatgactggtgcagaagctgatgcca 

5 tttcaagaaactatttagagtcaaaagggtatggaaaagaatttgg 
tcactaggacatggtattggtttagaaatccatgaagggccaatgctggc 
tcgtacgatacaagataaacttcaagttaacaactgtgttacagtagaac 
ctggtgtttatatagaaggtttgggcggtataagaatagaagatgatatt 
ttaattacagaaaatggttgtcaagtctttactaaatgcacaaaagacctt 

10 atagttttaacataa (seq id no: 28); (2) 

atggtcaaagtaactgattattcaaattcaaaattaggtaaagtagaaat 
agcgccagaagtgctatctgttattgcaagtatagctacttcggaagtcg 
aaggcatcactggccattttgctgaattaaaagaaacaaatttagaaa;^ 
gttagtcgtaaaaatttaagccgtgatttaaaaatcgagagtaaagaaga 
15 tggcatatatatagatgtatattgtgcattaaaacatggtaatatttcaaa 
aactgcaaacaaaattcaaacgtcaatttttaattcaam 
agcgatagaacctaagcaaattaatattcacattacacaaatcgttattg 

AAAAGTAA (SEQ ID NO: 30) or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30, with no greater than about a 15% mismatch under stringent 
20 conditions. 

14. The nucleic acid sequence of claim 13 wherein the mismatch is no 
greater than about 5%. 

25 15. The nucleic acid sequence of claim 14 wherein the mismatch is no 

greater than about 2%. 

16. A vector comprising the nucleic acid sequence of claim 9 
operatively linked to at least one control sequence that controls the expression or 

30 regulation of the nucleic acid sequence. 

17. A vector comprising the nucleic acid sequence of claim 10 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

35 
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18. A vector comprising the nucleic acid sequence of claim 1 1 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

5 19. A vector comprising the nucleic acid sequence of claim 12 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

20. A vector comprising the nucleic acid sequence of claim 13 

10 operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

2 1 . A host cell transfected with the vector of claim 1 6. 
15 22. A host cell transfected with the vector of claim 17. 

23 . A host cell transfected with the vector of claim 1 8. 

24. A host cell transfected with the vector of claim 19. 

20 

25. A host cell transfected with the vector of claim 20. 



26. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 
25 (a) culturing the host cell of claim 21 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 



30 27. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 22 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
35 sortase-transamidase enzyme. 
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28. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 23 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

5 (b) purifying the expressed enzyme to produce substantially purified 

sortase-transamidase enzyme. 

29. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

10 (a) culturing the host cell of claim 24 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 

15 30. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 25 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
20 sortase-transamidase enzyme. 

3 1 . Substantially purified sortase-transamidase enzyme produced by 
the process of claim 26. 

25 32. Substantially purified sortase-transamidase enzyme produced by 

the process of claim 27. 

33. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 28. 

30 

34. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 29. 

35. Substantially purified sortase-transamidase enzyme produced by 
35 the process of claim 30. 
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36. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 1; 

(b) performing an assay for sortase-transamidase in the presence and in 
the absence of the compoimd; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 



37. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 3; 

1 5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compoxmd; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

20 

38. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enz}ane of 

claim 31; 

25 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

30 

39. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 32; 

35 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compoimd; and 
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(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase-? 
transamidase activity. 

5 40, A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 33; 

(b) performing an assay for sortase-transamidase ia the presence and in 
10 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compoimd to screen the compound for sortase- 
transamidase activity. 

15 41, A method for screening a compoimd for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 34; 

(b) performing ah assay for sortase-transamidase in the presence and in 
20 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compoimd for sortase- 
transamidase activity. 

25 42. A method for screening a compoimd for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transanwdase enzyme of 

claim 35; 

(b) performing an assay for sortase-transamidase in the presence and in 
30 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 
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43. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing an active fraction of sortase-transamidase enzyme from 
a Gram-positive bacterium; 

(b) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compoxmd to screen the compound for sortase- 
transamidase activity. 

44. The method of claim 43 wherein the active fraction of sortase- 
transamidase enzyme is a particulate fraction from Staphylococcus aureus, 

45. The method of claim 43 wherein the assay for sortase- 

15 transamidase enzyme is performed by monitoring the capture of a soluble peptide that 
is a substrate for the enzyme by its interaction with an afSnity resin. 

46. The method of claim 45 wherein the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. 

20 

47. The method of claim 45 wherein the soluble peptide includes the 
active site of glutathione S-transferase and the affinity resin contains glutathione. 

48. The method of claim 45 wherein the soluble peptide includes the 
25 active site of streptavidin and the affinity resin contains biotin. 

49. The method of claim 45 wherein the soluble peptide includes the 
active site of maltose binding protein and the affinity resin contains amy lose. 

30 50. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 1 . 

5 1 . An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 3. 



35 



52. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 31. 
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53. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 32. 

5 54. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 33. 

55. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 34. 

10 

56. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 35. 

57. A protein molecule comprising the substantially purified sortase- 
15 transamidase enzyme of claim 1 extended at its carboxyl-terminus with a sufficient 

number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

20 58. A protein molecule comprising the substantially purified sortase- 

transamidase enzyme of claim 3 extended at its carboxyl-terminus with a sufficient 
niunber of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

25 

59. A protem molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 3 1 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

30 

60. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 32 extended at its carboxyl-terminus with a stifficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 



35 
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61 . A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 33 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

5 

62. A protein molecule comprising the substantially purified sortase- 
transamidase en2yme of claim 34 extended at its carboxyl-terminus with a sxifficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

10 

63. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 35 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

15 

64. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

20 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
firom the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected fi-om the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase of claim 1; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

30 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 



35 
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65. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

5 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
10 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase of claim 3; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

15 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

20 

66. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

30 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 31 ; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

35 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
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peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

67. A method for displaying a polypeptide on the surface of a Gram- 
5 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

10 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

15 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 32; 
and (iii) a Gram-positive bacterixmi having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 

20 cross-links the amino-terminal portion of the cleaved polypeptide to the 

peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 



68. A method for displaying a polypeptide on the surface of a Gram- 
25 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

30 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

35 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 33; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 
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(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
5 bacterium. 



69. A method for displaying a polypeptide on the surface of a Gram- 
positive bacteriimi comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
10 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domam, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 
15 from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 34; 
and (iii) a Gram^ositive bacterium having a peptidoglycan to which the sortase- 

20 transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 

25 bacterium. 



70. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 

30 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 31 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially , hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 

35 from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 
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(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 35; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 
5 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium, 

10 

7 1 . A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

15 carboxyl-terminal sorting signal, the chimeric protein including the polypeptide to be 
displayed, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 

20 being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) grov\dng the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

25 including therein a carboxyl-terminal sorting signal; and 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein vwthin the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

30 polypeptide is accessible to a ligand. 

72. A polypeptide displayed on the surface of a Gram-positive 
bacterium by covalent linkage of an amino-acid sequence of LPX3X4 derived from 
cleavage of an LPX3X4G motif, wherein X3 is any of the twenty naturally-occurring 

35 L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine, the polypeptide being displayed on the surface of the Gram-positive 
bacterixim in such a way that the polypeptide is accessible to a ligand. 
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73. A covalent complex comprising: 

(a) the polypeptide of claim 72; and 

(b) an antigen or hapten covalently cross-linked to the polypeptide. 

5 

74. The covalent complex of claim 73 wherein an antigen is covalently 
cross-linked to the polypeptide. 

75. The covalent complex of claim 73 wherein a hapten is covalently 
10 cross-linked to the peptide. 

76. A method for vaccination of an animal comprising the step of 
immuni2dng the animal with the displayed polypeptide of claim 72 to generate an 
immune response against the displayed polypeptide. 

. 15 

77. A method for vaccination of an animal comprising the step of 
immimizing the animal with the covalent complex of claim 73 to generate an immime 
response against the antigen or hapten of the covalent complex. 

20 78. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 

25 carboxyl to the motif; and (3) a, charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

30 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; the substantially purified sortase-transamidase enzyme of claim 1; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide through the sorting signal; 

35 (c) binding the chimeric protein covalently to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
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that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
5 partner. 

79. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
10 sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
caiboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
15 located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 3; and 

20 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

25 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

30 

80. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
35 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
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positively charged residues being arginine, the two positively charged residues being 
located at residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

5 (b) forming a reaction mixture including: (i) the expressed chimeric 

protein; (ii) the substantially purified sortase-transamidase enzyme of claim 31 ; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
10 enzymatic action of a sortase-transamidase expressed by the Gram-positive 

bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
15 partner to screen the chimeric .protein for reactivity with the labeled specific binding 

partner. 

81 . A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

20 (a) expressing a cloned polypeptide as a chimeric protein having a 

sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 

25 positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
30 protein; (ii) the substantially purified sortase-transamidase enzyme of claim 32; and 

(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 

35 bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

5 82. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 

10 carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

1 5 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 33; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

20 (c) binding the chimeric protein covalentiy to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

25 (d) reacting the displayed polypeptide vnth a labeled specific binding 

partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

83. A method for screening for expression of a cloned polypeptide 
30 comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G dierein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
35 residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
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occurring L-amino acids arid X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 34; and 

5 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

10 that the polypeptide is displayed on the sxirface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

15 

84. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

20 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 

25 occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 35; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 

30 transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 

35 such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

5 85. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal, the chimeric protein including the polypeptide 

10 whose expression is to be screened, the sorting signal having: (1) a motif of LPX3X4G 
therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids carboxyl to 
the motif; and (3) a charged tail region with at least two positively charged residues 
carboxyl to the substantially hydrophobic domain, at least one of the two positively 
charged residues being arginine, the two positively charged residues being located at 

15 residues 3 1-33 from the motif, wherein X3 is any of the twenty naturally-occurring 
L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine; 

(b) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

20 including therein a carboxyl-terminal sorting signal; 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein v^thin the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

25 polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 



30 86. A method for the diagnosis or treatment of a bacterial infection 

caused by a Gram-positive bacterium comprising the steps of: 

(a) conjugatmg an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal to produce a conjugate, the 
carboxyl-terminal sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

35 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 



wo 99/09145 



PCT/US98/16229 



-62- 

being arginine, the two positively charged residues being located at residues 3 1-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; and 
(b) introducing the conjugate to an organism infected with a Gram- 
5 positive bacterium in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 



87. The method of claim 86 wherein an antibiotic is conjugated to the 

protein. 

10 

88. The method of claim 87 wherein the antibiotic is selected from the 
group consisting of a penicillin, ampicillin, vancomycin, gentamicin, streptomycin, a 
cephalosporin, amikacin, kanamycin, neomycin, paromomycin, tobramycin, 
ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and a derivative 

1 5 of these antibiotics. 



89. The method of claim 86 wherein a detection reagent is conjugated 

to the protein. 

20 90. A conjugate comprising an antibiotic or a detection reagent 

covalently conjugated to a protein includmg therein a carboxyl-terminal sorting signal 
to produce a conjugate, the carboxyl-terminal sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 31 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 

25 residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine. 



30 



91 . The conjugate of claim 90 wherein an antibiotic is conjugated to 

the protein. 



35 



92. The conjugate of claim 91 wherein the antibiotic is selected from 
the group consisting of a penicillin, ampicillin, vancomycin, gentamicin, 
streptomycin, a cephalosporin, amikacin, kanamycin, neomycin, paromomycin. 
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tobramycin, ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and 
a derivative of these antibiotics. 

93. The conjugate of claim 90 wherein a detection reagent is 
5 conjugated to the protein. 

94. A composition comprising: 

(a) the conjugate of claim 90; and 

(b) a pharmaceutically acceptable carrier. 

95. A substantially purified protein having at least about 50% match 
with best alignment, in at least one subunit of the protein, with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), tiie 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 
Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. 



10 



15 



96. The substantially purified protein of claim 95 wherem the match 
with best alignment with the amino acid sequences of at least one of the putative 

20 Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 60%. 

97. The substantially purified protein of claim 96 wherein the match 
25 witii best alignment with the amino acid sequences of at least one of the putative 

Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 70%. 



30 98. A substantially purified protein having sortase-transamidase 

activity and a hydrophobicity profile of at least one subunit of the protein, that, 
determined as the mean absolute value of the hydrophobicity difference per residue^ 
differs fi"om the hydrophobicity profile of a putative Bacillus peptidase (SEQ ID NO: 
3) by no more than about 2 units on the hydrophobicity scale. 



35 
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99. The substantially purified protein of claim 98 wherein the 
hydrophobicity profile differs from the hydrophobicity profile of the putative fiad/Ziw 
peptidase (SEQ ID NO: 3) by no more than about 1 imit. 

5 1 00. The substantially pxuified protein of claim 99 wherein the 

hydrophobicity profile differs from the hydrophobicity profile of the putative 5flcf//uy 
peptidase (SEQ ID NO: 3) by no more than about 0.5 imit. 

101 . A nucleic acid sequence encoding the substantially purified 
1 0 protein of claim 95 . 

102. A nucleic acid sequence encoding the substantially purified 
protein of claim 98. 

15 103. A vector comprising the nucleic acid sequence of claim 101 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

104. A vector comprising the nucleic acid sequence of claim 102 
20 operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 



1 05 . A host cell transfected with the vector of claim 1 03 . 
25 1 06. A host cell transfected with the vector of claim 1 04. 



107. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 105 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 

108. A method for producing a substantially purified protein having 
35 sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 106 under conditions m which the 
host cell expresses the protein having sortase-transamidase activity; and 
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(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 



109. A substantially purified protein having sortase-transamidase 
5 activity and a hydrophobicity profile of at least one subunit of the protein, that, 
determined as the mean absolute value of the hydrophobicity difference per residue, 
differs fi-om the hydrophobicity profile of the sequence of SEQ ID NO: 3 1 by no more 
than about 2 units on the hydrophobicity scale. 

10 110. The substantially purified protein of claim 1 09 wherem the 

hydrophobicity profile differs fi-om the hydrophobicity profile of the sequence of SEQ 
ID NO: 3 1 by no more than about 1 unit. 



111. The substantially purified protein of claim 1 10 wherein the 

1 5 hydrophobicity profile differs fi-om the hydrophobicity profile of the sequence of SEQ 
ID NO: 3 1 by no more than about 0.5 imit. 

112. A nucleic acid sequence encoding the substantially purified 
protein of claim 1 09. 

20 

1 13. A vector comprising the nucleic acid sequence of claim 1 12 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 



25 1 14. A host cell transfected with the vector of claim 113. 

115. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 1 14 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 
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1 c 
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SEQUENCE LISTING 

<110> Schneewind, Olaf 
5 Ton-That, Hung 

Mazmanian, Sarkis 

<120> IDENTIFICATION OF SORTASE GENE 

10 

<130> 30435.46WO01 

<150> 60/055,662 

<151> 1997-08-14 

15 

<160> 34 

<170> FastSEQ for Windows Version 3.0 

20 <210> 1 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

25 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 1 





LteU <r£^o Aoa 


Thr Gly 




















30 


1 




5" 






















<210> 


2 
























<211> 


121 
























<212> 


PRT 






















35 


<213> 


Staphylococcus Aureus 














<400> 


2 
























Asp Pro Lys 


Leu 


Lys 


Glu 


He 


Tyr 


Gin 


He 


Val 


Leu 


Glu 


Ser Gin Met 




1 




5 










10 








15 


40 


Lys Ala lie 


Asn 


Glu 


He 


Arg 


Pro 


Gly 


Met 


Thr 


Gly 


Ala 


Glu Ala Asp 






20 










25 










30 




Ala lie Ser 


Arg 


Asn 


Tyr 


Leu 


Glu 


Ser 


Lys 


Gly 


Tyr 


Gly 


Lys Glu Phe 




35 










40 










45 






Gly His Ser 


Leu 


Gly 


His 


Gly 


He 


Gly 


Leu 


Glu 


He 


His 


Glu Gly Pro 


45 


50 








55 










60 






Met Leu Ala 


Arg 


Thr 


He 


Gin 


Asp 


Lys 


Leu 


Gin 


Val 


Asn 


Asn Cys Val 




65 






70 










75 






60 




Thr Val Glu 


Pro 


Gly 


Val 


Tyr 


He 


Glu 


Gly 


Leu 


Gly 


Gly 


He Arg He 








85 










90 








95 


50 


Glu Asp Asp 


He 


Leu 


He 


Thr 


Glu 


Asn 


Gly 


Cys 


Gin 


Val 


Phe Thr Lys 






100 










105 










110 




Cys Thr Lys 


Asp 


Leu 


He 


Val 


Leu 


Thr 














115 










120 














55 


<210> 


3 
























<211> 


353 
























<212> 


PRT 
























<213> 


Bacillus Sp. 


















60 


<400> 


3 
























Met Lys Leu 


Glu 


Lys 


Leu Arg 


Asn 


Leu 


Phe 


Gly 


Gin 


Leu 


Gly He Asp 




1 




5 










10 








15 




Gly Met Leu 


He 


Thr 


Ser 


Asn 


Thr 


Asn 


Val 


Arg 


Val 


Met 


Thr Gly Phe 




20 










25 










30 


65 


Thr Gly Ser 


Ala Gly Leu Ala 


Val 


He 


Ser 


Gly 


Asp 


Lys 


Ala Ala Phe 




35 










40 










45 






lie Thr Asp 


Phe Arg Tyr Thr 


Glu 


Gin 


Ala 


Lys 


Val 


Gin 


Val Lys Gly 




50 








55 










60 








Phe Glu He 


He 


Glu 


His 


Gly 


Gly 


Ser 


Leu 


He 


Gin 


Thr 


Thr Ala Asp 
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65 70 75 80 

Thr Val Glu Ser Phe Gly lie Lys Arg Leu Gly Phe Glu Gin Asn Ser 

85 90 95 

Met Thr Tyr Gly Thr Tyr Ala Ser Tyr Ser Ala Val He Ser Asp Ala 
5 100 105 110 

Glu Leu Val Pro Val Ala Glu Ser Val Glu Lys Leu Arg Leu He Lys 

115 120 125 

Ser Ser Glu Glu He Lys He Leu Glu Glu Ala Ala Lys He Ala Asp 
130 135 140 

10 Asp Ala Phe Arg His He Leu Thr Phe Met Lys Pro Gly He Ser Glu 
145 150 155 160 

He Ala Val Ala Asn Glu Leu Glu Phe Tyr Met Arg Ser Gin Gly Ala 

165 170 175 

Asp Ser Ser Ser Phe Asp Met He Val Ala Ser Gly Leu Arg Ser Ser 
15 180 185 190 

Leu Pro His Gly Val Ala Ser Asp Lys Leu He Glu Ser Gly Asp Leu 

195 200 205 

Val Thr Leu Asp Phe Gly Ala Tyr Tyr Lys Gly Tyr Cys Ser Asp He 
210 215 220 

20 Thr Arg Thr Val Ala Val Gly Gin Pro Ser Asp Gin Leu Lys Glu He 
225 230 235 240 

Tyr Gin Val Val Phe Asp Ala Gin Ala Leu Gly Val Ala His He Lys 

245 250 255 

Pro Gly Met Thr Gly Lys Glu Ala Asp Ala Leu Thr Arg Asp His He 
25 260 265 270 

Ala Ala Lys Gly Tyr Gly Asp Tyr Phe Gly His Ser Thr Gly His Gly 

275 280 285 

Leu Gly Met Glu Val His Glu Ser Pro Gly Leu Ser Val Arg Ser Ser 
290 295 300 

30 Ala He Leu Glu Pro Gly Met Val Val Thr Val Glu Pro Gly He Tyr 
305 310 315 320 

He Pro Glu Thr Gly Gly Val Arg He Glu Asp Asp He Val He Thr 

325 330 335 

Glu Asn Gly Asn Arg Thr He Thr His Ser Pro Lys Glu Leu He He 
35 340 345 350 

Leu 



<210> 4 
40 <211> 352 

<212> PRT 

<213> Lactococcus Lactis 
<400> 4 

45 Met Arg He Glu Lys Leu Lys Val Lys Met Leu Thr Glu Asn He Asp 
15 10 15 

Ser Leu Leu He Thr Asp Met Lys Asn He Phe Tyr Leu Thr Gly Phe 

20 25 30 

Ser Gly Thr Ala Gly Thr Val Phe Leu Thr Gin Lys Arg Asn He Phe 
50 35 40 45 

Met Thr Asp Ser Arg Tyr Ser Glu Met Ala Arg Gly Leu He Lys Asn 

50 55 60 

Phe Glu He He Glu Thr Arg Asp Pro He Ser Leu Leu Thr Glu Leu 
65 70 75 80 

55 Ser Ala Ser Glu Ser Val Lys Asn Met Ala Phe Glu Glu Thr Val Asp 

85 90 95 

Tyr Ala Phe Phe Lys Arg Leu Ser Lys Ala Ala Thr Lys Leu Asp Leu 

100 105 110 

Phe Ser Thr Ser Asn Phe Val Leu Glu Leu Arg Gin He Lys Asp Glu 
60 115 120 125 

Ser Glu He Ser Leu He Lys Lys Ala Cys Glu He Ala Asp Glu Ala 

130 135 140 

Phe Met Ser Ala Leu Arg Phe He Glu Pro Gly Arg Thr Glu He Glu 
145 150 155 160 

65 Val Ala Asn Phe Leu Asp Phe Lys Met Arg Asp Leu Glu Ala Ser Gly 

165 170 175 

He Ser Phe Glu Thr He Val Ala Ser Gly Lys Arg Ser Ser Leu Pro 

180 185 190 

His Gly Val Ala Thr Ser Lys Met He Gin Phe Gly Asp Pro Val Thr 
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195 


200 






205 




lie Asp Phe Gly Cys 


Tyr Tyr Glu 


His 


Tyr 


Ala Ser Asp 


Met Thr Arg 




215 






220 


'PK** T1a CVa IVaI 

Tnr lie rne vai Giy 


Ser Val Asp Asp 


Lys 


Met Arg Thr 


He Tyr Glu 




230 






235 


240 


Tnr val Arg Lys Ala 


Asn Glu Ala 


Leu 


He 


Lys Gin Val 


Lys Ala Gly 


245 






250 




255 


Ma^ TK** T«»v> &1a /^1t\ 

new inr lyr Axa uin 


Tyr Asp Asn 


He 


Pro 


Arg Glu Val 


He Glu Lys 


260 




265 






270 


Ala Asp Phe Gly Gin 


Tyr Phe Thr 


His 


Gly 


He Gly His 


Gly Leu Gly 


275 


280 






285 


Leu Asp Val His Glu 


He Pro Tyr 


Phe 


Asn 


Gin Ser Met 


Thr Glu Asn 


290 


295 






300 




Gin Leu Arg Ser Gly 


Met Val He 


Thr 


Asp 


Glu Pro Gly 


He Tyr Leu 


305 . 


310 






315 


320 


Pro Glu Phe Gly Gly 


Val Arg He Glu 


Asp 


Asp Leu Leu 


Val Thr Glu 


325 






330 


335 


Asn Gly Cys Glu Val 


Leu Thr Lys 


Ala 


Pro 


Lys Glu Leu 


He Val He 


340 




345 






350 



20 

<210> 5 

<211> 368 

<212> PRT 

<213> Lactobacillus Oelbrueckii Lactis 

25 

<400> 5 





Met Asn 


Leu Asp 


Lys 


Leu 


Gin 


Asn 


Trp 


Leu 


Gin 


Glu Asn Gly Met Asp 




1 • 




5 










10 




15 




Val Ala 


Tyr Val 


Ser 


Ser 


Pro 


Thr 


Thr 


He 


Asn 


Tyr Phe Thr Gly Phe 


30 




20 










25 






30 




He Thr 


Asp Pro 


Glu 


Glu 


Arg 


He 


Phe 


Lys 


Leu 


Phe Ala Phe Lys Asp 






35 








40 








45 




Ala Glu 


Pro Phe 


Leu 


Phe 


Cys 


Pro 


Ala 


Leu 


Asn 


Tyr Glu Glu Ala Lys 




50 








55 










60 


35 


Ala Ser 


Ala Trp 


Asp 


Gly 


Asp 


Val 


Val 


Gly 


Tyr 


Leu Asp Ser Glu Asp 




65 






70 










75 


80 




Pro Trp 


Ser Lys 


He 


Ala 


Glu 


Glu 


He 


Lys 


Lys 


Arg Thr Lys Asp Tyr 








85 










90 




95 




Gin Asn 


Trp Ala 


Val 


Glu 


Lys 


Asn 


Gly 


Leu 


Thr 


Val Ala His Tyr Gin 


40 




100 










105 






110 




Ala Leu 


His Ala 


Gin 


Phe 


Pro 


Asp 


Ser 


Asp 


Phe 


Ser Lys Asp Leu Ser 






115 








120 








125 




Asp Phe 


He Ala 


His 


He 


Arg 


Leu 


Phe 


Lys 


Thr 


Glu Ser Glu Leu Val 




130 








135 










140 


45 


Lys Leu 


Arg Lys 


Ala 


Gly 


Glu 


Glu 


Ala 


Asp 


Phe 


Ala Phe Gin He Gly 




145 






150 










155 


160 




Phe Glu 


Ala Leu 


Arg 


Asn 


Gly 


Val 


Thr 


Glu 


Arg 


Ala Val Val Ser Gin 








165 










170 


175 




He Glu 


Tyr Gin 


Leu 


Lys 


Leu 


Gin 


Lys 


Gly 


Val 


Met Gin Thr Ser Phe 


50 




180 










185 






190 




Asp Thr 


He Val 


Gin 


Ala 


Gly 


Lys 


Asn 


Ala 


Ala 


Asn Pro His Gin Gly 






195 








200 








205 




Pro Ser 


Met Asn 


Thr 


Val 


Gin 


Pro 


Asn 


Glu 


Leu 


Val Leu Phe Asp Leu 




210 








215 










220 


55 


Gly Thr 


Met His 


Glu 


Gly 


Tyr 


Ala 


Ser 


Asp 


Ser 


Ser Arg Thr Val Ala 




225 






230 










235 


240 




Tyr Gly 


Glu Pro 


Thr 


Asp 


Lys 


Met 


Arg 


Glu 


He 


Tyr Glu Val Asn Arg 








245 










250 




255 




Thr Ala 


Gin Gin 


Ala 


Ala 


He 


Asp 


Ala 


Ala 


Lys 


Pro Gly Met Thr Ala 


60 




260 










265 






270 




Ser Glu 


Leu Asp 


Gly 


Val 


Ala 


Arg 


Lys 


He 


He 


Thr Asp Ala Gly Tyr 






275 








280 








285 




Gly Glu 


Tyr Phe 


He 


His 


Arg 


Leu 


Gly 


His 


Gly 


He Gly Met Glu Val 




290 






295 










300 


65 


His Glu 


Phe Pro 


Ser 


He 


Ala 


Asn 


Gly 


Asn 


Asp 


Val Val Leu Glu Glu 




305 






310 










315 


320 




Gly Met 


Cys Phe 


Ser 


He 


Glu 


Pro 


Gly 


He 


Tyr 


He Pro Gly Phe Ala 




325 










330 




335 




Gly Val 


Arg He 


Glu 


Asp 


Cys 


Gly 


Val 


Leu 


Thr 


Lys Asp Gly Phe Lys 
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340 345 350 

Pro Phe Thr His Thr Ser Lys Glu Leu Lys Val Leu Pro Val Lys Glu 
355 360 365 

5 <210> 6 

<211> 25 
<212> PRT 

<213> Staphylococcus Aureus 

10 <400> 6 

Glu Glu Asn Pro Phe lie Gly Thr Thr Val Phe Gly Gly Leu Ser Leu 

1 5 10 15 

Ala Leu Gly Ala Ala Leu Leu Ala Gly 
20 25 



15 



20 



<210> 7 
<211> 23 
<212> PRT 

<213> Staphylococcus Aureus 



<400> 7 

Gly Glu Glu Ser Thr Asn Lys Gly Met Leu Phe Gly Gly Leu Phe Ser 

1 5 10 15 

He Leu Gly Leu Ala Leu Leu 
25 20 

<210> 8 
<211> 24 
<212> PRT 

30 <213> Staphylococcus Sobrinos 

<400> 8 

Asp Ser Ser Asn Ala Tyr Leu Pro Leu Leu Gly Leu Val Ser Leu Thr 
15 10 15 

35 Ala Gly Phe Ser Leu Leu Gly Leu 
20 

<210> 9 
<211> 24 
40 <212> PRT 

<213> Enterococcus Faecalis 

<400> 9 

Glu Lys Gin Asn Val Leu Leu Thr Val Val Gly Ser Leu Ala Ala Met 
45 1 5 10 15 

Leu Gly Leu Ala Gly Leu Gly Phe 
20 

<210> 10 
50 <211> 23 

<212> PRT 

<213> Streptococcus Pyogenes 
<400> 10 

55 Ser He Gly Thr Tyr Leu Phe Lys He Gly Ser Ala Ala Met He Gly 
15 10 15 

Ala He Gly He Tyr He Val 
20 

60 <210> 11 

<211> 22 
<212> PRT 

<213> Listeria Monocytogenes 

65 <400> 11 

Asp Ser Asp Asn Ala Leu Tyr Leu Leu Leu Gly Leu Leu Ala Val Gly 

15 10 15 

Thr Ala Met Ala Leu Thr 
20 
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10 



15 



<210> 12 

<211> 5 

<212> PRT 

<213> Staphylococcus Aureus 

<400> 12 
Arg Arg Arg Glu Leu 
1 5 

<210> 13 
<211> 9 
<212> PRT 

<213> Staphylococcus Aureus 
<400> 13 

Arg Arg Asn Lys Lys Asn His Lys Ala 
1 5 



20 <210> 14 

<211> 5 

<212> PRT 

<213> Staphylococcus Sobrinus 

25 <400> 14 

Arg Arg Lys Gin Asp 
1 5 



<210> 15 
30 <211> 7 

<212> PRT 

<213> Enterococcus Faecalis 
<400> 15 

35 Lys Arg Arg Lys Glu Thr Lys 
1 5 



<210> 16 

<211> 5 

40 <212> PRT 

<213> Streptococcus Pyogenes 

<400> 16 

Lys Arg Arg Lys Ala 
45 1 5 



<210> 17 
<211> 8 
<212> PRT 

50 <213> Actinomyces Viscosus 



<400> 17 
Lys Arg Arg His Val Ala Lys His 
1 5 

<210> 18 
<211> 5 
<212> PRT 

<213> Streptococcus Aglactiae 

<400> 18 
Lys Arg Arg Lys Ser 
1 5 



65 <210> 19 

<211> 6 

<212> PRT 

<213> Streptococcus Pyogenes 
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<400> 19 
Lys Arg Lys Glu Glu Asn 
1 5 



5 <210> 20 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 20 
Arg Arg Arg Glu Ser 
15 1 5 



20 



<210> 21 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

25 <400> 21 

Arg Arg Arg Ser Leu 
1 5 



<210> 22 
30 <211> 5 

<212> PRT 

<213> Artificial Sequence 



<220> 

35 <223> Mutated derived from Streptococcus Pyogenes 



40 



45 



<400> 22 
Arg Arg Ser Glu Leu 
1 5 

<210> 23 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 



<400> 23 

50 Arg Ser Arg Glu Leu 
1 5 

<210> 24 

<211> 5 

55 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

60 

<400> 24 
Ser Arg Arg Glu Leu 
1 5 



65 <210> 25 

<211> 5 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 25 
Arg Arg Ser Ser Ser 
1 5 



10 



15 



20 



25 



30 



<210> 26 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 26 
Arg Ser Arg Ser Ser 
1 5 

<210> 27 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 27 
Ser Arg Arg Ser Ser 
1 5 



35 



<210> 28 
<2X1> 1017 
<212> DNA 

<213> Staphylococcus Aureus 



40 



45 



50 



55 



<220> 
<221> CDS 
<222> (716) ., 

<400> 28 



. (1014) 



cgactctaga 
tactattatt 
tgaggggcat 
aataagcatt 
tttaattcaa 
aaaagcnngc 
aggagcagat 
accatggtgt 
ggcgcnttnt 
aacccgatcc 
tggtgttgca 
gtattataac 



ggacaagcaa 
ggtgagatta 
catgtnnttt 
tctaatactg 
aaagcagcta 
atgactgaaa 
ggaccntctt 
tgcaagtgat 
tataaccggc 
ntaaactgaa 
agtgataaaa 
ggctattgtt 



ctaagcaggc 
aaaaattgct 
atgatacata 
tagataaaat 
atattgttga 
aanaattaaa 
tcgatacgat 
aaaattattg 
tattgttcaa 
aagaaattgt 
ttattgaaaa 
cagatattac 



gccaaattat 
acnccaanaa 
ccttgaatta 
tagagacgtc 
tgaaacatat 
ggcaatatta 
tgtngcatct 
aaaaaggcga 
attttactan 
agcatctgtt 
aggcgacatg 
tagaacattt 



gaaattatta 
aattttgaaa 
aataaaagcc 
caagatgctg 
gaatatattt 
gaaagccaaa 
ggtcctagag 
catgattaca 
aacattttgc 
catagaggtg 
atnacattag 
gctattggag 



atcgtaaatc 
atgttngttt 
gtatatcatt 
acgaaattgc 
taactgttgt 
tgctanaatt 
gtgcattacc 
ttanattttg 
nattgggaaa 
cattaccaca 
atttnggcgc 
aacca gat 
Asp 
1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
718 



60 



65 



cct aaa ctg aaa gaa ata tat caa ata gta ctt gaa tct caa atg aaa 766 

Pro Lys Leu Lys Glu lie Tyr Gin lie Val Leu Glu Ser Gin Met Lys 

5 10 15 

gca att aat gag att aga cct ggc atg act ggt gca gaa get gat gcc 814 
Ala He Asn Glu He Arg Pro Gly Met Thr Gly Ala Glu Ala Asp Ala 
20 25 30 

att tea aga aac tat tta gag tea aaa ggg tat gga aaa gaa ttt gga 862 

He Ser Arg Asn Tyr Leu Glu Ser Lys Gly Tyr Gly Lys Glu Phe Gly 
35 40 45 

eat tea eta gga cat ggt att ggt tta gaa ate eat gaa ggg eea atg 910 
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His 


Ser Leu Gly 


His 


Gly 


He Gly Leu Glu He His Glu Gly Pro Met 




50 








55 


60 


65 




ctg 


gnt cgt 


acg 


ata 


caa 


gat aaa ett eaa gtt aae aac tgt 


gtt aca 


5 


Leu 


Xaa Arg 


Thr 


He 


Gin Asp Lys Leu Gin Val Asn Asn Cys 


Val Thr 










f U 




75 


80 






gna cct 


ggt 


gtt 


nat 


ata gaa ggt ttg ggc ggt ata aga ata gaa 




Val 


Xaa Pro Gly 


Val 


Xaa 


He Glu Gly Leu Gly Gly He Arg He Glu 


10 






85 






90 95 






gag 


ata tt taa 












Glu 


He 












1 c 
13 




















<210> 


29 














<211> 


100 














<212> 


PRT 














<213> 


Staphylococcus Aureus 








<400> 


29 












Asp 


Pro Lys 


Leu 


Lys 


Glu 


He Tyr Gin He Val Leu Glu Ser 


Gin Met 




1 






5 




10 


15 


25 


Lys 


Ala He 


Asn 


Glu 


He 


Arg Pro Gly Met Thr Gly Ala Glu Ala Asp 








20 






25 30 






Ala 


He Ser 


Arg 


Asn Tyr 


Leu Glu Ser Lys Gly Tyr Gly Lys Glu Phe 






35 








40 45 






Gly His Ser 


Leu 


Gly 


His 


Gly He Gly Leu Glu He His Glu Gly Pro 


30 




50 








55 60 






Met 


Leu Xaa 


Arg 


Thr 


He 


Gin Asp Lys Leu Gin Val Asn Asn 


Cys Val 




65 








70 


75 


80 




Thr 


Val Xaa 


Pro 


Gly Val 


Xaa He Glu Gly Leu Gly Gly He Arg He 










85 




90 


95 


35 


Glu 


Glu He 


Phe 
















100 














<210> 


30 














<211> 


360 










40 




<212> 


DNA 














<213> 


Staphlococcus Aureus 








<220> 
















<221> 


CDS 










45 




<222> 


(1) . 


... (357) 










<400> 


30 












atg 


gtc aaa 


gta 


act 


gat 


tat tea aat tea aaa tta ggt aaa 


gta gaa 




Met 


Val Lys 


Val 


Thr 


Asp 


Tyr Ser Asn Ser Lys Leu Gly Lys 


Val Glu 




1 






5 




10 


15 




ata 


gcg cca 


gaa 


gtg 


eta 


tct gtt att gea agt ata get act 


teg gaa 




He 


Ala Pro 


Glu 


Val 


Leu 


Ser Val He Ala Ser He Ala Thr 


Ser Glu 








20 






25 30 




55 


















gtc 


gaa ggc 


ate 


act 


ggc 


cat ttt get gaa tta aaa gaa aca 


aat tta 




Val 


Glu Gly 


He 


Thr 


Gly 


His Phe Ala Glu Leu Lys Glu Thr 


Asn Leu 






35 








40 45 




60 


gaa 


aaa gtt 


agt 


cgt 


aaa 


aat tta age cgt gat tta aaa ate 


gag agt 




Glu 


Lys Val 


Ser 


Arg 


Lys 


Asn Leu Ser Arg Asp Leu Lys He 


Glu Ser 






50 








55 60 





958 



1006 



1017 



48 



96 



144 



192 



aaa gaa gat ggc ata tat ata gat gta tat tgt gea tta aaa cat ggt 240 

65 Lys Glu Asp Gly He Tyr He Asp Val Tyr Cys Ala Leu Lys His Gly 

65 70 75 80 

aat att tea aaa act gea aac aaa att caa acg tea att ttt aat tea 288 

Asn He Ser Lys Thr Ala Asn Lys He Gin Thr Ser He Phe Asn Ser 
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85 90 95 

att tct aat atg aca gcg ata gaa cct aag caa att aat att cac att 336 
lie Ser Asn Met Thr Ala lie Glu Pro Lys Gin He Asn He His He 
5 100 105 HO 

aca caa ate gtt att gaa aag taa 360 
Thr Gin He Val He Glu Lys 
115 

10 

<210> 31 
<211> 119 
<212> PRT 

15 <213> Staphylococcus Aureus 



<400> 31 





Met 


Val 


Lys 


Val 


Thr 


Asp 


Tyr 


Ser 


Asn 


Ser Lys 


Leu Gly Lys Val 


Glu 




1 








5 










10 




15 




20 


He 


Ala 


Pro 


Glu 


Val 


Leu 


Ser 


Val 


He 


Ala Ser 


He 


Ala Thr Ser 


Glu 










20 










25 






30 






Val 


Glu 


Gly 


He 


Thr 


Gly 


His 


Phe 


Ala 


Glu Leu 


Lys 


Glu Thr Asn 


Leu 








35 










40 








45 






Glu 


Lys 


Val 


Ser 


Arg 


Lys 


Asn 


Leu 


Ser Arg Asp 


Leu 


Lys He Glu 


Ser 


25 




50 










55 








60 








Lys 


Glu 


Asp 


Gly 


He 


Tyr 


He 


Asp 


Val 


Tyr Cys 


Ala 


Leu Lys His Gly 




65 










70 








75 






80 




Asn 


He 


Ser 


Lys 


Thr 


Ala 


Asn 


Lys 


He 


Gin Thr 


Ser 


He Phe Asn 


Ser 










85 










90 




95 




30 


He 


Ser 


Asn 


Met 


Thr 


Ala 


He 


Glu 


Pro 


Lys Gin 


He 


Asn He His 


He 










100 










105 






110 






Thr 


Gin 


He 


Val 


He 


Glu 


Lys 















115 



35 <210> 32 

<2H> 19 
<212> PRT 

<213> Artificial Sequence 

40 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 32 

His His His His His His Ala Gin Ala Leu Glu Pro Thr Gly Glu Glu 
45 1 5 10 15 

Asn Pro Phe 



<210> 33 
50 <211> 1119 



<212> DNA 

<213> Streptococcus Pyogenes 
<220> 

55 <221> CDS 

<222> (1) . . . (995) 

<400> 33 

atg eta caa tat tct caa aag tta cca aag gag ttc gcg atg tea gga 48 

60 Met Leu Gin Tyr Ser Gin Lys Leu Pro Lys Glu Phe Ala Met Ser Gly 

15 10 15 

ttt tta gaa caa cga tta ggt cac tgc eta agg cag atg gea gag aag 96 
Phe Leu Glu Gin Arg Leu Gly His Cys Leu Arg Gin Met Ala Glu Lys 
65 20 25 30 

ggg eta gag get ctt eta gtc aee eat tta aee aat agt tat tac ttg 144 
Gly Leu Glu Ala Leu Leu Val Thr His Leu Thr Asn Ser Tyr Tyr Leu 
35 40 45 
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aca ggt ttt tct gga act gca gca act gtt ttg ata acg gcc aaa cgt 
Thr Gly Phe Ser Gly Thr Ala Ala Thr Val Leu lie Thr Ala Lys Arg 
50 55 60 

cgt gtt ttg ate aca gat tea cgt tat acc ttg ctt get aaa get agt 
Arg Val Leu lie Thr Asp Ser Arg Tyr Thr Leu Leu Ala Lys Ala Ser 
65 70 75 80 

gtt gag gga ttt gat att ate gaa age egc acg ccg ctt aag gtt gtg 
Val Glu Gly Phe Asp lie lie Glu Ser Arg Thr Pro Leu Lys Val Val 
85 90 95 

gca gaa ttg tta gag get gat caa ata gat tgc ctt ggt ttt gag gae 
Ala Glu Leu Leu Glu Ala Asp Gin lie Asp Cys Leu Gly Phe Glu Asp 
100 105 110 

eag gta teg ttt tct ttt tac cag gcc atg caa gca gaa etg tea gga 
Gin Val Ser Phe Ser Phe Tyr Gin Ala Met Gin Ala Glu Leu Ser Gly 
115 120 125 

ata ace ttg ctt get eag tea ggt ttt gtg gag eat tta cgt ctt att 
He Thr Leu Leu Ala Gin Ser Gly Phe Val Glu His Leu Arg Leu He 
130 135 140 

aag gae gcc tct gaa ate gat acc att get aaa gcg tgc teg ate tea 
Lys Asp Ala Ser Glu He Asp Thr He Ala Lys Ala Cys Ser He Ser 
145 150 155 160 

gae aaa gca ttt gaa gat get ctt gat ttt att aaa cea ggg aca ace 
Asp Lys Ala Phe Glu Asp Ala Leu Asp Phe He Lys Pro Gly Thr Thr 
165 170 175 

act gaa cgt gae etg get aat ttt tta gat ttt cgt atg cgt cag tat 
Thr Glu Arg Asp Leu Ala Asn Phe Leu Asp Phe Arg Met Arg Gin Tyr 
180 185 190 

ggt gee age ggc aca tea ttt gat ate att gta get tea ggc tat etc 
Gly Ala Ser Gly Thr Ser Phe Asp He He Val Ala Ser Gly Tyr Leu 
195 200 205 

tct gcc atg cct eat gga egc gcc agt gae aag gtt ate cag aat aaa 
Ser Ala Met Pro His Gly Arg Ala Ser Asp Lys Val He Gin Asn Lys 
210 215 220 

gag age ttg acc atg gae ttt ggg tgt tac tac aat cac tat gtt agt 
Glu Ser Leu Thr Met Asp Phe Gly Cys Tyr Tyr Asn His Tyr Val Ser 
225 230 235 240 

gat atg acg agg acc att eat att eat att ggc caa gtt act gat gaa 
Asp Met Thr Arg Thr He His He His He Gly Gin Val Thr Asp Glu 
245 250 255 

gaa cgt gag att tat get ctt gtt ctt get get aat aag get tta att 
Glu Arg Glu He Tyr Ala Leu Val Leu Ala Ala Asn Lys Ala Leu He 
260 265 270 

get aaa get age get ggc atg act tat agt gae ttt gae ggt att ccg 
Ala Lys Ala Ser Ala Gly Met Thr Tyr Ser Asp Phe Asp Gly He Pro 
275 280 285 

egc caa etc ate act gag gcg ggt tat ggc agt egc ttc aca eat ggc 
Arg Gin Leu He Thr Glu Ala Gly Tyr Gly Ser Arg Phe Thr His Gly 
290 295 300 

att ggt cat ggc ate ggg ctt gae ate cat gag aat cea ttt ttt ggg 
He Gly His Gly He Gly Leu Asp He His Glu Asn Pro Phe Phe Gly 
305 310 315 320 
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aaa tct gag caa ctt etc caa get gga atg gtg gt aacagatgag 1005 
Lys Ser Glu Gin Leu Leu Gin Ala Gly Met Val 
325 330 

5 ccaggtatct atttggataa caaatatggt gtccgtattg aagatgactt ggttatcaca 1065 
aaactggctt gtcaagtctt gaccttggca cccaaagaat taattgtatt gtaa 1119 

<210> 34 
<211> 332 
10 <212> PRT 

<213> Streptococcus Pyogenes 



<400> 34 
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Gin 


Tyr 


Ser 


Gin 


Lys 
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Pro 
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Glu 


Phe 


Ala 


Met 


Ser 


Gly 


1 e 
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Phe 


Leu 


Glu 


Gin 


Arg 


Leu 


Gly 


His 


Cys 


Leu 


Arg 


Gin 


Met 


Ala 


Glu 


Lys 










20 










25 










30 






Gly 


Leu 


Glu 


Ala 


Leu 


Leu 


Val 


Thr 


His 


Leu 


Thr 


Asn 


Ser 


Tyr 


Tyr 


Leu 








35 










40 










45 




20 


Thr 


Gly 


Phe 


Ser 


Gly 


Thr 


Ala 


Ala 


Thr 


Val 


Leu 


He 


Thr 


Ala 


Lys 


Arg 






50 










55 










60 








Arg 


Val 


Leu 


He 


Thr 


Asp 


Ser 


Arg 


Tyr 


Thr 


Leu 


Leu 


Ala 


Lys 


Ala 


Ser 




65 










70 










75 








80 




Val 


Glu 


Gly 


Phe 


Asp 


He 


He 


Glu 


Ser 


Arg 


Thr 


Pro 


Leu 


Lys 


Val 


Val 


25 










85 










90 








95 






Ala 


Glu 


Leu 


Leu 


Glu 


Ala 


Asp 


Gin 


He 


Asp 


Cys 


Leu 


Gly 


Phe 


Glu 


Asp 










100 










105 










110 






Gin 


Val 


Ser 


Phe 


Ser 


Phe 


Tyr 


Gin 


Ala 


Met 


Gin 


Ala 


Glu 


Leu 


Ser 


Gly 








115 










120 










125 






30 


lie 


Thr 




Leu 


Ala 


Gin 


Ser 


Gly 


Phe 








Leu 


Arg 


Leu 


xxe 






130 










135 










140 










Lys 


Asp 


Ala 


Ser 


Glu 


He 


Asp 


Thr 


He 


Ala 


Lys 


Ala 


Cys 


Ser 


He 


Ser 




145 










150 










155 










160 




Asp 


Lys 


Ala 


Phe 


Glu 


Asp 


Ala 


Leu 


Asp 


Phe 


He 


Lys 


Pro 


Gly 


Thr 


Thr 


35 










165 










170 










175 






Thr 


Glu 


Arg 


Asp 
180 


Leu 


Ala 


Asn 


Phe 


Leu 
185 


Asp 


Phe 


Arg 


Met 


Arg 
190 


Gin 


Tyr 




Gly 


Ala 


Ser 


Gly 


Thr 


Ser 


Phe 


Asp 


He 


He 


Val 


Ala 


Ser 


Gly 


Tyr 


Leu 








195 










200 










205 




40 


Ser 


Ala 


Met 


Pro 


His 


Gly 


Arg 


Ala 


Ser 


Asp 


Lys 


Val 


He 


Gin 


Asn 


Lys 
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215 










220 










Glu 


Ser 


Leu 


Thr 


Met 


Asp 


Phe 


Gly 


Cys 


Tyr 


Tyr 


Asn 


His 


Tyr 


Val 


Ser 




225 










230 










235 










240 
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Met 


Thr 


Arg 


Thr 


He 


His 


He 


His 


He 


Gly 


Gin 


Val 


Thr 


Asp 


Glu 


45 










245 










250 










255 






Glu 


Arg 


Glu 


He 
260 


Tyr 


Ala 


Leu 


Val 


Leu 
265 


Ala 


Ala 


Asn 


Lys 


Ala 
270 


Leu 


He 




Ala 


Lys 


Ala 
275 


Ser 


Ala 


Gly 


Met 


Thr 
280 


Tyr 


Ser 


Asp 


Phe 


Asp 
285 


Gly 


He 


Pro 


50 


Arg 


Gin 
290 


Leu 


He 


Thr 


Glu 


Ala 
295 


Gly 


Tyr 


Gly 


Ser 


Arg 
300 


Phe 


Thr 


His 


Gly 




He 


Gly 


His 


Gly 


He 


Gly 


Leu 


Asp 


He 


His 


Glu 


Asn 


Pro 


Phe 


Phe 


Gly 
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315 
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Lys 


Ser 


Glu 


Gin 


Leu 


Leu 


Gin 


Ala 


Gly 


Met 


Val 


Val 
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